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ABSTRACT 


The  purpose  of  this  thesis  is  to  investigate  various  combinatorial 
and  probabilistic  properties  of  graphs  when  the  restriction  is  imposed 
that  the  points  of  the  graph  have  been  separated  into  two  distinguishable, 
mutually  exclusive  and  exhaustive  subsets  and  only  points  which  belong 
to  different  subsets  may  be  joined  by  an  edge0  Such  graphs  are  often 
called  bipartite  graphs. 

Expressions  for  the  counting  series  for  various  types  of  such 
graphs  are  obtained  in  Chapter  I. 

Bipartite  tournaments  are  considered  in  Chapter  II.  A  few  of 
their  properties  are  developed  and  certain  results  known  about  ordinary 
tournaments  are  extended  to  the  bipartite  case.  Ordinary  tournaments 
often  arise  in  connection  with  the  method  of  paired  comparisons  and 
bipartite  tournaments  also  provide  natural  models  for  certain  situations. 
Some  problems  associated  with  some  of  these  are  treated. 

In  Chapter  III  the  main  emphasis  is  upon  obtaining  a  few  typical 
results  of  a  structural  nature  for  bipartite  graphs.  These  can  be 
interpreted  as  expressing  some  structural  characteristic  of  a  graph  as 
a  function  of  the  number  of  points  and  edges  it  contains. 
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CHAPTER  I 

ON  SOME  COUNTING  PROBLEMS  FOR  BIGRAPHS 

1. 1  Definitions 

A  bipartite  graph  or,  for  the  sake  of  brevity,  a  bigraph,  consists 
of  two  finite  distinguishable  sets  of  things  called  points,  P  and  Q  , 
and  a  subset  E  of  the  set  of  all  ordered  pairs  (p,q)  ,  where  p  e  P 
and  q  €  Q.  The  elements  of  E  are  called  edges  and  p  and  q  are 
said  to  be  adjacent,  or  joined,  if,  and  only  if,  (p,q)  e  E.  The  edge 
(p,q)  is  incident  upon  the  points  p  and  q.  According  to  Konig  [ 57 3 ^ 
p.  170,  the  term  ’’bipartite",  or  "bipartie",  appears  to  have  been  used 
in  this  context  first  by  Sainte-Lague  [88],  Two  bigraphs  with  point  sets 
P  and  Q,  and  P*  and  Q’,  respectively,  are  defined  as  being  isomorphic 
if,  and  only  if,  there  exists  a  one-to-one  correspondence  between  the 
points  of  P  and  those  of  P',  and  between  the  points  of  Q  and  those 
of  Q’,  respectively,  which  preserves  all  adjacency  relations.  The 
number  of  points  in  P  and  Q  will  be  denoted  by  m  and  n  , 
respectively.  As  trivial  cases  we  shall  admit  the  possibility  that  m 
or  n  equals  zero  but  not  both.  By  the  degree  of  a  given  point  in  a 
given  bigraph  is  meant  the  number  of  edges  incident  upon  it.  As  general 
references  on  the  theory  of  graphs  see  Berge  [3]  and  Konig  [57l. 

In  this  chapter  we  discuss  how  some  of  the  methods  used  in 
enumerating  ordinary  graphs,  i.e.  graphs  in  which  no  distinction  is  made 
between  points  in  P  and  points  in  Q  as  defined  here,  may  be  applied 
to  the  counting  of  bigraphs  and  obtain  the  counting  series  for  various 


types  of  such  graphs 
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1.2  P&lya*  s  theorem 

Useful  in  many  enumeration  problems  is  a  theorem  due  to  Polya  [7^]. 
Using  the  terminology  of  Harary  [36]  a  statement  of  the  theorem  follows, 
restricted  to  the  case  of  one  variable,  which  will  suffice  for  roost  of 
our  purposes. 

We  take  a  figure  to  be  an  undefined  term  but  assume  that  each 
figure  may  be  characterized  in  some  fashion  by  a  single  nonnegative 
integer  called  its  content.  If  there  are  cp^  different  figures  of 
content  k  then  the  figure  counting  series,  cp(x)  ,  is  defined  by 

00 

(1.2.1)  cp(x)  =  ^  \  xk 

k=Q 

A  configuration  of  length  s  is  a  sequence  of  s  figures  and 
its  content  is  defined  as  being  the  sum  of  the  contents  of  the  figures 

of  which  it  is  composed.  If  G  is  some  permutation  group  on  s 

elements  of  order  h  then  two  configurations  of  length  s  are  said 
to  be  G-equivalent  if,  and  only  if,  there  is  a  permutation  of  G  ,  the 
configuration  group,  which  takes  one  into  the  other.  The  configuration 
counting  series,  F(x)  ,  is  defined  by 

CO 

(1.2.2)  F(X)  =  £  Fkxk  , 

k=0 

where  F^  denotes  the  number  of  G- inequivalent  configurations  of 
content  k. 

In  order  to  express  F(x)  in  terms  of  G  and  cp(x)  ,  the  object 
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of  the  theorem,  the  cycle  index  of  G  ,  Z(g)  ,  is  needed.  Let 

denote  the  number  of  permutations  of  G  of  type  (j)  =  (j^,  j  ,  jg), 

i.e.  having  j  cycles  of  length  k  ,  k  =  1,  2,  .  ..,  s.  Clearly 

(1.2.3)  l*j1  +  2*J2  +  ...  +  s*jg  =  s  . 

If  f^,  f^,  ...,  fg  are  indeterroinates  then  the  cycle  index  of 
G  is  defined  by 


(1.2.4) 


where  the  sum  is  over  all  partitions  (j)  satisfying  (1.2.3). 


For  example,  the  cycle  index  of  ,  the  symmetric  group  of 

permutations  of  n  elements, is  (see  e.g.  Riordan  [84],  p.  68) 


summed  over  all  partitions  (j)  satisfying  (1.2.3)  with  s  replaced 
by  n. 

For  any  function  f(x)  ,  let  Z(GJ  f(x))  denote  the  function 
obtained  from  Z(G)  upon  substituting  f(x^)  for  f^  ,  k  =  1,  2,  . . . ,  s  . 
Then  Polya's  theorem  may  be  stated  symbolically  as  follows! 


(1.2.6)  F(x)  =  Z(G;  cp(x) )  . 

Some  problems  to  be  treated  in  a  later  section  will  require  a 
slight  modification  of  this  theorem  with  a  corresponding  extension  of  the 
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definition  of  a  cycle  index  to  deal  with  the  case  where  more  than  one 
kind  of  figure  is  involved. 

1.3  The  number  of  bigraphs 

In  this  section  we  use  Polya's  theorem  to  determine  the  number  of 
nonisoroorphic  bigraphs  with  a  given  number  of  points  and  edges.  Most 
of  these  results  have  been  obtained  by  Harary  [42]  and  are  included 
here  for  the  sake  of  completeness. 

A  bigraph  may  be  regarded  as  a  configuration  whose  figures  are 
ordered  pairs  of  points,  the  first  point  of  each  pair  belonging  to  P 
and  the  second  point  of  each  pair  belonging  to  Q.  We  define  the 
content  of  such  a  figure  to  be  one  or  zero  according,  respectively,  as 
to  whether  the  two  points  forming  the  pair  are  or  are  not  joined  by  an 
edge  in  a  given  graph.  Since  these  are  the  only  possibilities  in  this 
case  the  figure  counting  series  will  be 

(1.3.1)  cp(x)  =  1  +  x  . 

The  content  of  the  graph,  being  the  sum  of  the  contents  of  its 
figures,  will  be  equal  to  the  total  number  of  edges  it  contains  and, 
hence,  may  range  between  0  and  mn  for  m  by  n  bigraphs. 

In  determining  if  two  m  by  n  bigraphs  are  isomorphic  any 
element  of  ,  the  symmetric  group  of  permutations  of  m  things,  may 

be  applied  to  the  points  in  P  ,  in  conjunction  with  the  application  of 
any  element  of  to  the  points  of  Q.  Hence  the  configuration  group, 

under  permutations  of  which  some  graphs  will  be  considered  equivalent, 
is  isomorphic  to  y  ,  the  direct  product  group  of  and  S^. 
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However  it  is  of  degree  run  since  the  combined  effect  of  permuting 


points  of  P  and  Q  among  themselves,  respectively,  is  the  permuting 
of  the  mn  ordered  pairs  of  points,  one  from  P  and  one  from  Q  , 
among  themselves. 

Denote  this  configuration  group  by  G  ;  then  upon  setting 

mn 

d.3.2)  g^x)  =  2,  W  xt  ' 

t=o 

where  g  ^  denotes  the  number  of  nonisomorphic  bigraphs  with  m 
mn,  t 

points  in  P  ,  n  points  in  Q  ,  and  t  edges,  Polya's  theorem  gives 
the  result  that 

d.3.3)  gran  (x)  =  z(Gmn  ,  l«)  . 


It  remains  to  obtain  an  expression  for  ^(^ron)*  To  accomplish 
this  we  first  observe  that  formally  multiplying  Z(S  )  by  Z(S^)  > 
both  of  which  may  be  regarded  as  known  by  (1.2. 5)^  exhibits,  with 
the  correct  coefficient,  all  the  combinations  of  cycle  types  which  can 
occur  when  permutations  of  are  applied  to  the  points  of  P  and 

permutations  of  are  applied  to  the  points  of  Q.  A  typical  term 

will  have  the  appearance 


) 


where  h!1!!  and  h/^x  denote  the  number  of  elements  of  S  and  S 
(j)  U)  m  n 

of  type  (j)  and  (i)  ,  respectively,  and  where  (j)  and  (i)  satisfy 
(1.2.3)  with  s  replaced  by  ro  and  n  ,  respectively. 
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To  determine  the  cycle  character  of  the  permutation  on  the 
figures  of  the  graph  induced  by  permutations  of  type  (j)  and  (i) 
being  applied  to  the  points  of  P  and  Q  ,  respectively,  we  consider 
any  figure  whose  first  element  lies  in  a  cycle  of  length  a  , 

1  <  a  <  m  ,  and  whose  second  element  lies  in  a  cycle  of  length  b  , 

1  <  b  <  n.  The  P  point  of  the  figure  first  returns  into  itself  after 
a  applications  of  a  given  permutation  of  type  (j)  to  the  points  of 
P  and,  similarly,  the  Q  point  of  the  figure  first  returns  into 
itself  after  b  applications  of  a  given  permutation  of  type  (i) 
to  the  points  of  Q.  Hence,  the  complete  figure  is  first  returned 
into  itself  after  [a,b]  simultaneous  applications  of  permutations 
of  type  (j)  and  (i)  to  the  points  of  P  and  Q  ,  respectively, 
where  [a,b]  denotes  the  least  common  multiple  of  a  and  b. 

Since  there  were  cycles  of  length  a  and  cycles  of 

length  b  in  the  respective  permutations  of  the  points  of  P  and  Q, 
it  follows  that  there  are  ja*a*^*b  figures  of  the  type  we've  just 
considered.  As  these  are  divided  into  cycles  of  length  [a,b]  it 
follows  that  there  are  j  i,  (a,b)  cycles  of  length  [a,b]  among 

3  D 

those  figures  whose  points  had  the  above  properties,  under  the 
application  of  permutations  of  type  (j)  and  (i)  to  the  points  of  P 
and  Q  ,  respectively,  where  (a,b)  denotes  the  greatest  common  divisor 
of  a  and  b. 

The  same  argument  applies  to  all  such  pairs,  a  and  b,  and 
yields  the  complete  cycle  characters  of  the  permutation  of  the  figures 
of  the  graph  induced  by  the  application  of  permutations  of  type  (j) 
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and  (i)  to  the  points  of  P  and  Q  ,  respectively,  since  the  points 
of  every  figure  lie  in  a  cycle  of  some  length  and  each  figure  is  counted 
exactly  one. 

Applying  the  same  procedure  to  each  of  the  terms  in  the  product 
of  Z(S  )  and  Z(S  )  gives  the  result  that 


n 


m 


where  the  sum  is  over  the  partitions,  (j)  and  ( H )  ,  described  above 


and  the  product  is  over  pairs  of  integers,  a  and  b  ,  such  that 
1  <  a  <  m  and  1  <  b  <  n. 


For  example,  one  finds  that 


and  replacing  f  by  1+x1  gives  the  corresponding  counting  series  for 
the  number  of  nonisomorphic  J  by  3  bigraphs. 


Using  Polya’s  theorem  the  first  few  nontrivial  counting  series 


for  bigraphs  are  found  to  be 


s2,2(x)  =  1  +  x  +  3x2  +  ...  , 


2  5 

s3  2^  =  1  +  x  +  3X  +  3X"'  +  ...  j 
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(1.3.5) 


gM(x) 

g4,4(x) 


g5,2(x) 

&3(x) 

g5,4M 


-  8  - 

2  -  ^  4  5  6 

1  +  x  +  3x  +  OX-'  +  llx  +  13x^  +  17x  +  ...  , 

1  +  x  +  3x2  +  6x"  +  l6x^  +  2  lx''  +  39x<^  +  44x7 

_c  6 

+  55x  +  ...  , 

1  +  x  +  3x  +  3x^  +  6x  +  6x-'  +  . . .  , 

1  +  x  +  3x2  +  6xJ  +  llx^  +  l8x""  +  26x°  +  29x7  +  ...  , 

1  +  x  +  3x2  +  6x5  +  1 6x^  +  2rJx':  +  53X6  +  80x7 

+  120x^'  +  l40x9  +  159x10  +  . . .  , 

1  +  x  +  3x2  +  6x-  +  l6x^  +  34x^  +  69x°  +  130x7 
+  234x6  +  367x9  +  527xl°  +  63hxU  +  755x12  +  ...  , 


where  the  remaining  terms  in  each  series  may  be  obtained  by  symmetry 
since  there  is  a  one-to-one  correspondence  between  the  nonisomorphic 
graphs  with  t  edges  and  those  with  mn-t  edges  induced  by  the  process 
of  complementation,  i.e.  forming  a  new  graph  in  which  two  points,  one 
of  which  is  in  P  and  the  other  is  in  Q  ,  are  joined  by  an  edge  if, 
and  only  if,  they  are  not  joined  in  the  original  graph. 


Thus,  for  example,  while  there  are  (2^)  =  5^200,300  5^5 

bigraphs  with  12  edges,  the  number  of  ways  of  selecting  the  12  edges 
from  the  25  that  are  possible,  from  (1.3.5)  we  see  that  these  belong 
to  only  755  essentially  different  types. 


Following  Harary  [38]  a  bigraph  of  strength  s  may  be  defined 
as  one  in  which  there  may  be  up  to  s  indistinguishable  edges  joining 
a  point  in  P  to  a  point  in  Q  5  it  is  of  type  t  if  there  are  t 


distinguishable  kinds  of  edges  available  for  joining  a  point  in  P  to 
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a  point  in  Q  where  no  more  than  one  edge  of  any  one  kind  joins  two 
points.  These  two  definitions  may  be  combined  to  define  a  bigraph  that 
is  both  of  strength  s  and  type  t. 


The  definition  of  isomorphism  of  two  bigraphs  given  previously 
may  be  extended  in  a  straightf orward  manner  to  include  these 
possibilities.  The  enumeration  of  the  number  of  nonisomorphic  graphs 
of  these  generalized  types  is  accomplished  in  the  same  way  as  that  of 
the  ordinary  ones  with  the  exception  that  the  figure  counting  series, 
for  obvious  reasons,  are  now 


(1.3.6) 


1  +  x  + 


(1  +  x)fc  , 


and 


(1  +  x  +  ... 


> 


respectively.  Setting  s  =  t  =  1  gives  (l.J.l)  again. 

A  bigraph  is  said  to  be  directed  if  for  each  edge  (p,q)  one 
of  the  points,  p  or  q  ,  is  specified  as  being  the  initial  point  and 
the  other  point  is  specified  as  being  the  terminal  point  of  the  edge 
and  it  is  further  permitted  that  two  directed  edges  may  join  p  and 
q  such  that  p  is  the  initial  point  of  one  of  them  and  q  is  the 
initial  point  of  the  other.  Two  directed  bigraphs  are  isomorphic  if 
they  satisfy  the  previous  definition  of  isomorphism  with  the  additional 
requirement  that  the  sense  of  direction  of  all  edges  is  preserved  in 
the  correspondence. 
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By  the  outdegree  of  a  point  in  a  directed  graph  is  meant  the 
number  of  edges  in  the  graph  having  it  as  the  initial  point.  The 
indegree  of  a  point  has  a  similar  definition. 

The  counting  of  nonisomorphic  directed  bigraphs  proceeds  along 
lines  similar  to  the  previously  considered  case.  The  only  essential 
difference  is  that  the  configuration  group  is  now  of  degree  2mn  which 
results  in  the  fact  that  the  cycle  index  for  the  directed  case  can  be 
obtained  from  that  in  (1.3.4)  by  increasing  the  exponent  of  each 
indeterminate  by  a  factor  of  two.  An  oriented  graph  is,  by  definition, 
a  directed  graph  in  which  not  more  than  one  oriented  edge  joins  any 
pair  of  distinct  points.  It  is  not  difficult  to  see  (cf,  Harary  [40]) 
that  the  counting  of  nonisomorphic  oriented  bigraphs  may  be  effected 
by  substituting  into  the  cycle  index  in  (1.3.4)  the  figure  counting 
series 

(1.3.7)  1  +  2x  . 

Many  other  types  of  bigraphs  may  be  counted  in  a  similar  fashion 
by  performing  appropriate  modifications  to  the  cycle  index  and  the 
figure  counting  series.  We  may  observe  that  for  bigraphs  there  is, 
in  a  sense,  a  certain  amount  of  overlap  between  the  concepts  of  graphs 
of  strength  s  and  type  t  and  directed  and  oriented  graphs  which 
does  not  carry  through  for  more  general  types  of  graphs. 

The  expression  given  in  (1.3.4)  for  the  cycle  index  of 
is  contained  in  Harary  [42]  as  equation  (7).  In  that  paper  when  m  =  n 
the  sets  P  and  Q  are  permitted  to  be  interchanged  in  determining 
whether  two  bigraphs  are  isomorphic.  To  find  the  cycle  index  for  this 
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case  requires  the  use  of  a  concept  known  as  group  exponentiation  which 
we  shall  not  discuss  here.  Also,  instead  of  having  just  two  sets  of 
points  such  that  only  points  in  different  sets  may  be  joined,  the  more 
general  situation  where  there  may  be  an  arbitrary  number  of  such  sets 
is  mentioned. 


Using  an  application  of  Polya's  theorem  described  in  [38],  which 
we  shall  use  in  §1.6,  Harary  and  Prins  [46]  have  obtained  the  counting 
series  for  such  graphs  which  are  connected, whereby  a  connected  graph 
is  meant  one  such  that  for  every  ordered  pair  of  distinct  points, 

(a,b)  ,  there  exists  a  path  from  a  to  b  ,  i.e.  a  sequence  of  distinct 
points  starting  with  a  and  ending  with  b  such  that  successive  points 
in  the  sequence  are  adjacent#.  They  also  have  counted  the  number  of 
nonisomorphic  bicoloured  graphs  with  no  isolated  points,  where  a 
bicoloured  graph  is  an  ordinary  graph  whose  points  have  each  been 
given  one  of  two  colours,  but  not  both,  such  that  each  edge  joins  two 
points  of  different  colour.  An  isolated  point  is  a  point  of  degree 
zero  in  the  present  context. 


If  g  (x)  denotes  the  counting  series  for  nonisomorphic  m 

ID  y  H 

by  n  bipartite  graphs  with  no  isolated  points,  then  it  is  easily  seen, 
for  m,n  >  1  ,  that 


(l-3-8>  S„,,n<x>  =  8ra,n(x)  -  gn,-l,nW  ’  gn,n-l(x)  +  Vl,n-l(x) 


Denoting  the  points  of  P  and  Q  by  P^,  P^,  ...  ,  P^  and 
Qf,  Qg,  ...,  Qn  ,  respectively,  one  may  say  that  two  such  labelled 
graphs  are  isomorphic  if,  and  only  if,  they  are  identical.  Lee  [62] 
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has  counted  the  number  of  labelled  bipartite  graphs  with  no  isolated 
points. 

Read  [78]  and  Austin  [2]  have  counted  the  number  of  labelled 
k-partite  graphs,  i.e.  graphs  with  k  distinct  sets  of  labelled  points 
such  that  no  point  is  adjacent  to  another  in  the  same  set  as  itself. 
Wright  [92]  has  investigated  the  asymptotic  behavior  of  these  numbers. 
Gilbert  [30]  has  considered  a  related  problem  when  k  =  2  where, 
among  other  things,  the  edges  are  labelled  as  well  and  there  is  no 
restriction  as  to  how  many  edges  may  join  two  points. 

1.4  The  number  of  rooted  bipartite  trees 


By  an  m  by  n  (bipartite)  tree  is  meant  an  ro  by  n  bigraph 


which  is  connected  and  acyclic,  i.e.  contains  no  cycles  or  nontrivial 
paths  whose  initial  and  terminal  points  coincide.  An  m  by  n  tree 
in  which  one  P  or  Q  point  is  singled  out  and  distinguished  from  all 
the  other  points  will  be  called  a  P  or  Q  rooted  tree,  respectively. 
Two  rooted  trees  are  isomorphic  if,  and  only  if,  they  are  isomorphic 
in  the  sense  of  the  definition  in  §1.1  and,  furthermore,  their  roots 
correspond. 


Let  r 


(P) 


and  rVN'  denote  the  number  of  nonisomorphic  m  by 
mn 


ran 


n 


P  and  Q  rooted  trees,  respectively.  Then 


In  this  section  we  indicate  how  the  value  of  these  numbers  may 


be  obtained 


,  ■ 


. 

. 


Consider  any  id  by  n  P  rooted  tree  in  which  the  root  is  of 


degree  £  ,  where  1  <  £  <  n.  Such  a  tree  may  be  formed  from  K 

0;  1 

0  by  1  Q  rooted  trees,  K  1  by  1  Q  rooted  trees,  K  0  1  by  2 

1^ 1  lyd 

Q  rooted  trees,  . ..  ,  K  j  by  h  Q  rooted  trees,  ...  ,  and 

11 

K  .  m-1  by  n  Q  rooted  trees,  where 

m-l,n  J  ’ 


+  K 


m-l,n 


(1.4.2)  0  •  K  .  +  1  *  K_  .  +  1»  K1  ^  +  ...  +  j*K.  ,  +  ... 

u,  i  1,1  1,2  j,n 

+  (m-l)  K  ,  =  m-1  , 

'  m-l,n 

and 


1  * 1  * K.  i  +  2 * K.  _  +  ...  +  h*K.  -  +  ... 

0,1  1,1  1,2  J,  h 


+  n  K  =  n  , 

m-l,n  7 


by  joining  each  of  the  roots  of  the  £  Q  rooted  trees  to  a  new  P 
point  and  calling  this  added  point  the  root  of  the  m  by  n  tree  thus 
constructed. 

Counting  the  number  of  different  ways  in  which  this  may  be  done 

gives 


(1.4.3) 


(P) 

c 

m,n 


n 


r^  +  K  -  1 

Jbh 


j=Q, . . .  jtn-1 
h=l,...,  n 


K 


where  the  sum  is  over  £  =  1,  2,  . ..,  n  and  over  all  partitions,  (k)  , 
satisfying  (1.4.2)  ,  since  the  number  of  ways  of  selecting  k  things 


from  a 


*  x*  +  Ic  J 

set  of  r  with  repetition  permitted  is  (  ^  ).  (See, 


e.g.  Riordan  [64],  p.  7.)  As  r^  =  r^ 


n,m 


this  permits  the 


-  -  c 
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numbers  to  be  computed  recursively.  Letting 


00 


(x,y) 


(1.4.3)  becomes 


(1.4.4)  r^  (x,y)  =  x  [|  (1  -  yh) 

j=0 

h=l 


■  I 


.(P) 


m;n 


ro  n 
x  y 


r(Q)  (y,x) 


m=l 

n=0 


CO 


-r 


(Q) 


If  r  denotes  the  number  of  nonisomorphic  ordinary  rooted 
trees  on  t  points  it  is  easily  seen  that 


(P) 

r  =  r 
m,n  1 


m+n=t 


C(Q) 

m,n 


m+n=t 


Hence, 


00 


(1.4.5)  r(x)  =  ^ 


rt  X 


t  =  l 


=  r^P^(x,x)  =  x  j(  (l-xj)  j  , 

j=l 


.  -r  . 


which  is  a  form  due  to  Cayley  [7]. 


Formally  taking  the  logarithm  of  (1.4.4)  gives 


CO 


log  r^(x,y) 


=  log  x  +  ^  r^  log ( l-x^  y11)"1 

j=0 

h=l 

00 

1  X”1  (Q)  (  j  h  2j  2h  /.  \ 

=  log  x  +  )  r  ^'  (xJ  y  +  x  J  y  /2  +  . . .  ) 

j=0 

h=l 

=  log  x  +  r^(x,y)  +  r^\x2  ,y2  )/2  +  ...  , 


or 


. 


. 


-- 

. 
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(1.4.6)  r^(x,y)  =  x  exp[r^(x,y)  +  r^(x2 ,y2 )/2  +  ...]  , 


since  these  are  only  generating  functions  and  convergence  is  immaterial. 

Setting  x  =  y  in  this  gives  another  analogous  property  of  the 
generating  function  for  the  number  of  ordinary  rooted  trees. 

Following  Riordan  [64],  p.  135>  i-n  his  treatment  of  the  same 

problem  for  ordinary  rooted  trees,  an  alternate  approach  is  to  apply 

Polya's  theorem,  where  two  variables  are  involved,  to  a  configuration 

which  consists  of  a  P  point,  the  root,  joined  to  the  roots  of  £ 

Q  rooted  bipartite  trees.  The  figure  counting  series  is  simply 

r^^(x,y)  and  since  the  £  trees  may  be  permuted  in  any  fashion  the 

cycle  index  is  Z(S^).  The  root  contributes  an  additional  P  point  so 

the  configuration  counting  series  is,  by  Polya's  theorem, 

x  Z(S  ;  r^^(x,y)).  Summing  over  £  and  using  the  identity  (cf. 
i * 


Riordan  [64],  p.  66) 


CO 


CO 


£=1 


2=1 


The  first  few  terms  of  r 


(1.4.8)  r(P)(x,y) 


2  3  4 


.  • 


. 
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1.5  A  dissimilarity  characteristic  theorem  for  bipartite  trees 

The  set  of  all  transformations  of  a  bigraph  into  itself  which 
preserve  all  adjacency  relations  and  the  property  of  a  points 
belonging  to  P  or  Q  ,  respectively,  form  a  group  which  is  called 
the  automorphism  group  of  the  graph.  (See  Konig  [57];  P.  5.)  Two 
points  of  a  graph  are  said  to  be  similar  if,  and  only  if,  there  exists 
an  element  of  the  automorphism  group  of  the  graph  which  takes  one 
point  into  the  other.  Similarity  of  edges  is  defined  in  the  same  way. 
The  relation  of  being  similar  is  an  equivalence  relation  and  separates 
the  points  and  edges  of  a  graph  into  mutually  exclusive  and  exhaustive 
classes  of  similar  points  and  edges,  respectively.  To  facilitate  the 
counting  of  the  number  of  nonisomorphic  bipartite  trees  in  terms  of 
the  number  of  nonisomorphic  rooted  bipartite  trees  we  prove  a 
dissimilarity  characteristic  theorem  for  bipartite  trees.  The  proof 
follows  that  given  by  Otter  [71]  for  a  corresponding  result  for 
ordinary  trees  with  some  simplification  arising  from  the  fact  that 
no  similar  points  are  adjacent  in  a  bipartite  tree. 

Theorem  1.5.1.  The  dissimilarity  characteristic  of  any  bipartite 

tree,  the  number  of  classes  of  dissimilar  points  minus  the  number  of 
classes  of  dissimilar  edges,  equals  one. 

If  T  is  some  bipartite  tree  consider  a  subset  of  the  points  and 
edges  of  T  which  themselves  form  a  subtree,  T*  ,  such  that  no  two 
distinct  points  or  edges  of  T*  are  similar  to  each  other.  Let  a 
point  q  in  T*  be  joined  to  a  point  p  which  is  in  T  but  not  in 
T*.  If  there  exists  an  element  a  of  the  automorphism  group  of  T 
such  that  (p)a  =  p*  ,  where  p'  is  in  T*  ,  then  we  may  make  the 


. 

'  !i  . 

'•  . 

•  -  "  . 


. 


-  17  - 


following  assertion! 

(1.5.1)  (q)  a  =  q'  =  q  . 

The  removal  of  £  ,  the  edge  joining  p  and  q  ,  splits  T 
into  two  bipartite  trees ,  and  ,  from  the  definition  of  a 

bipartite  tree.  Also,  the  removal  of  £'  ,  the  edge  joining  p1  and 
q*  and  which  exists  from  the  definition  of  a  ,  splits  T  into  two 
bipartite  trees,  T^,  and  T^,.  If  q*  ^  q  ,  then  £'  is  not  in  T* 
since,  by  hypothesis,  there  were  no  distinct  similar  points  in  T*. 
Hence  T*  is  included  in  T  ,  ,  as  is  the  edge  £.  Clearly,  p'  , 
q’  ,  and  £'  are  in  T  .  Since  £  is  not  in  T  ,  it  follows  that 

q  q 

T  ,  is  included  in  T  .  Furthermore,  p*  is  in  T  but  not  in  T 

qf  q  q  q' 

But,  since  T  is  a  finite  configuration,  this  contradicts  the  fact 
that  (lq)a  =  Tqf  .  A  simple  diagram  makes  the  reasons  for  some  of  the 
statements  clearer. 

Now  let  T*  be  the  largest  subtree  of  T  ,  in  terms  of  the 
total  number  of  points  involved,  having  no  distinct  similar  points  or 
edges.  Then  any  point  of  T  which  is  not  in  T'  but  which  is  joined 
to  a  point  in  T‘  must  be  similar  to  some  point  of  T*.  By  induction 
it  follows  immediately  that  every  point  in  T  ,  not  in  T*  ,  is  similar 
to  one,  and  of  necessity  only  one,  point  in  T*.  Consider  any  edge, 

£  ,  which  joins  a  point  p  to  a  point  q  and  which  is  not  in  T*. 

Then  there  exists  an  element,  a  ,  of  the  automorphism  group  of  T 
such  that  (q)ct  is  in  T*.  If  (p)ct  is  in  T*  also  then  £  is 
similar  to  an  edge  of  T'.  If  not,  there  exists  another  automorphism, 

0  ,  such  that  ((p)a)£  is  in  T'.  By  (1.5. l)  ((q)a«)j3  =  (q)a  ,  so 


T 


* 


-  i 
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in  any  case  we  have  that  every  edge  of  T  not  belonging  to  T*  is 
similar  to  one,  and  only  one,  edge  of  T'.  Hence,  there  are  as  many 
dissimilar  classes  of  points  and  edges  in  T  as  there  are  points  and 
edges,  respectively,  in  T'.  But  T*  is  itself  a  tree  and  the  number 
of  points  it  contains  minus  the  number  of  edges  it  contains  equals  one, 
by  a  form  of  Euler's  theorem  about  polyhedra  due  to  Listing  [6 3].  (See 
also  Konig  [57]*  P.  51.)  This  suffices  to  prove  Theorem  1.5.1. 

The  theorem  may  also  be  deduced  as  a  special  case  of  the 
dissimilarity  characteristic  theorem  for  Husiroi  trees  or  ordinary 
linear  graphs  contained  in  Harary  and  Norman  [35]*  [36]*  and  [45], 

1.6  The  numb er  of  bipartite  trees 

To  determine  T^  ^  ,  the  number  of  nonisomorphic  m  by  n  trees, 

we  observe  that  the  total  number  of  dissimilar  points  minus  the  total 

number  of  dissimilar  edges  in  all  nonisomorphic  m  by  n  trees  is 

T  ,  by  Theorem  1.5.1.  The  first  quantity  is  simply  the  total  number 
ro,  n 

of  rooted  m  by  n  trees  and  the  second  is  the  number  of  ways  of 
forming  an  m  by  n  tree  by  joining  the  root  of  a  P  rooted  tree  to 
the  root  of  a  Q  rooted  tree.  Thus 


X  =  r(p)  +  r(Q)  -  V  r(P)  •  r(Q) 

m,n  m,n  m,n  j,h  m-j,n-h 

j=l, . . . ,m 
h=0, . . . ,n-l 

In  the  counting  series  form  this  becomes 


00 


(1.6.1)  T(x,y)  =  ^  Tn,n  xI“  ^  +  rW(x,y)-rtP)(x,y)-r(Q)(x,y), 

m,n=0 


0,0 


with  T 


=  0  by  definition.  Since  the  right  member  involves  quantities 


I  i  .  .  * 

. 

. 
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that  may  be  regarded  as  known  this  enables  the  left  member  to  be 


evaluated  also.  One  finds  the  first  few  terms  to  be  as  follows! 


If  t(x,y)  is  the  counting  series  for  acyclic  bigraphs 


i.e.  bigraphs  each  of  whose  connected  components  is  a  bipartite  tree, 


then  by  applying  Polya’s  theorem  to  a  configuration  whose  figures  are 
bipartite  trees  and  using  the  identity  in  (1.4.7)  one  has  that 


00 


£=1 

A  corresponding  result  may  be  written  for  acyclic  bigraphs  each 
of  whose  connected  components  is  a  rooted  bipartite  tree. 


Riordan,  in  [8 J]  and  [85] }  and  Harary  and  Prins  [44]  have  counted 


ordinary  trees  in  terms  of  several  different  parameters,  the  bipartite 
analogues  of  which  could  also  be  considered.  Riordan,  in  the  first 
paper  above,  counts  the  number  of  trees  on  n  points  whose  points  may 
be  coloured  in  c  ,  an  arbitrary  positive  integer,  colours  such  that  no 
two  points  having  the  same  colour  are  adjacent  but  this  does  not  contain 
(l.6„ 1).  Also,  contained  in  these  papers  are  references  to  methods  of 
counting  ordinary  trees  which  do  not  make  use  of  a  dissimilarity 


. 
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and 

characteristic  theorem^which  could  undoubtedly  be  adapted  to  the 
bipartite  case,  although  they  probably  would  not  be  any  simpler  than 
the  method  used  here. 

1.7  The  number  of  labelled  bipartite  trees 

The  definition  of  a  labelled  P  or  Q  rooted  m  by  n  tree 

combines  the  definitions  of  a  labelled  m  by  n  bigraph  with  that  of  a 

P  or  Q  rooted  m  by  n  tree,  both  of  which  have  already  been  stated. 

(p)  (q) 

In  this  section  we  derive  formulae  for  R  and  R  ,  the  respective 


m,n 


m,n 


number  of  these,  where 


(1.7.1) 


r|P)  =  r^q|  =  1 
1,0  0,1 


and 


R.(P]  =  r(Q)  =  0 

0,1  1,0 


,(P) 


Let  R'  '  (£)  denote  the  number  of  labelled  P  rooted  m  by  n 
m,n  '  '  J 

trees  in  which  the  root  is  adjacent  to  precisely  £  other  points,  where 
1  <  i  <fl.  Each  such  bipartite  tree  may  be  formed  by  combining  £ 
labelled  Q  rooted  trees  by  joining  their  roots  to  an  added  P  point 
called  the  root,  where  the  total  number  of  P  and  Q  points  in  the 
combined  trees  is  m-1  and  n  ,  respectively.  Upon  counting  the 
number  of  ways  of  forming  these  trees  and  labelling  the  points  one 
finds  that 


(1.7.2) 


R(p)  U)  =  jr 
m,n  £\ 


(rc-1): 


n 


jii  ...  J®.*  hi’  ...  *•*  "  j h .  " 


(Q) 


R 


(Q) 


V  1 


'£’  £ 


where  the  sum  is  over  all  partitions,  (j)  and  (h)  ,  such  that 


+  j2  +  ...  +  j,  *  ™-i  > 


and 


. 

•  • 
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+  hg  +  . . .  +  -  n 

The  A\  factor  arises  from  the  fact  that  the  A  edges  joining 
the  root  to  points  which  were  previously  roots  may  be  permuted  in  any 
of  A\  ways. 


Then 


00 


00 


«•«>  I  *2 <»££•*(! TThT  1  ■ 


ro,n>l 


j=0 

h=l 


(Q), 


Denoting  the  sum  in  the  parenthesis  by  R'  '(x,y)  and 
(P) 

defining  R'  '(x,y)  similarly,  summing  over  A  implies,  since 

Ri!o  =  1  ’  chac 


(I.7.1*-)  =  X  +  k  R^(x,y)  +  x[R^',(x,y)J‘i  /  2.'  + 


=  x  exp  R^^(x,y)  . 


R^\x,y)  =  R^^(y,x)  and  Lagrange’s  formula  may  be  used  to 
determine  the  coefficients  in  these  series.  This  is  the  approach  used 
by  Austin  [2]  and  also  by  Scoins  [90]  who  has  recently  rederived  some 
of  the  results  of  Austin. 


An  alternate  procedure  consists  in  modifying  a  method  used  by 

Clarke  [9]  to  derive  the  formula,  due  originally  to  Cayley  [6],  that 
n_2 

there  are  n  ordinary  labelled  trees  on  n  points.  Using  Clarke’s 
terminology  we  may  proceed  as  follows. 


In  the  present  context  rooted  trees  in  which  the  root  is  of 
degree  A  are  said  to  be  of  type  A .  An  ordered  pair  of  labelled 


•- 
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m  by  n  P  rooted  trees,  (X,Y)  ,  where  X  is  of  type  £-1  and  Y  is 
of  type  £  ,  is  called  a  linkage  if,  and  only  if,  Y  can  be  obtained 
from  X  by  choosing  one  of  the  n-(i-l)  Q  points  not  adjacent  to 
the  root  and  replacing  the  edge  which  goes  from  the  point  towards  the 
root  by  an  edge  joining  the  point  to  the  root  directly.  That  such  an 
edge  exists  and  is  unique  follows  from  the  fact  that  trees  are 
connected  and  acyclic  by  definition.  The  total  number  of  linkages  for 
a  given  value  of  £  is  then 

(1.7.5)  In-U-l)]  •  (i-1)  . 


The  removal  of  the  root  of  Y  and  the  £  edges  joining  it  to 
other  points  leaves  £  distinct  subtrees.  Let  there  be  t^  points  of 
P  in  the  ith  such  tree,  for  i  =  1,  2,  ...,  £.  Clearly 
ti  +  tg  +  ...  +  t^  =  ro-1.  Now,  all  the  bipartite  trees  of  type  £-1 
linked  to  Y  may  be  obtained  by  removing  one  of  the  £  edges  incident 
upon  the  root  of  Y  and  replacing  it  by  an  edge  joining  the  same  Q 
point  to  a  P  point  not  in  the  same  subtree  as  it  is.  Counting  the 
number  of  ways  of  doing  this  gives  another  expression  for  the  total 
number  of  linkages  for  a  given  value  of  £  ,  namely 

(1.7.6)  [(m-l-t1)  +  (m-l-tg)  +  ...  +  (m-l-t^)]  (i)  =  (i-l) (m-l)R^ 


U)  . 


Equating  the  expressions  in  (1.7.5)  and  (1.7.6)  and  using 
the  rather  obvious  fact  that  R^^  (n)  =  inn™  ^  implies  the  following 
result! 


(1.7.7)  (e)  =  TO.”"1  (£*)  (m-l)(n'l)'(^1))  l  =  1,  ...,  n  . 


f 


. 

. 


• 

. 

•  . 
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For  ro,n  >  1  every  labelled  P  rooted  id  by  n  tree  is  of  one 
of  these  types,  so  upon  summing  over  &  we  have  that 

,,  o\  .,(P)  n-1  m-1 

(1.7.8)  Rv  '  =  m  m  n 

'  1  '  m,n 

An  immediate  consequence  is  that  if  R  denotes  the  number  of 

m,n 

labelled  m  by  n  trees,  m,n  >  1  ,  then 
fl  r -r  r^\  n  n-1  m— 1 

(1.7.9)  R  _  =  m  n 

ro,n 

By  other  methods  Austin  [2]  has  counted  what  could  be  called 
labelled  k-partite  trees.  His  formula  includes  Cayley’s  result  as  a 
special  case  when  each  of  the  k  sets  of  points  contains  only  a  single 
point.  However  there  appear  to  be  some  inherent  difficulties  in 
trying  to  extend  the  approach  used  here  to  the  more  general  situation. 

Let  F  (k)  denote  the  number  of  labelled  m  by  n  bigraphs 
m,  n 

which  consist  of  k  disjoint  bipartite  trees  such  that  Q^,  Q^,  ..., 
all  belong  to  separate  trees,  where  m  >  1  and  n  >  k  >  1.  After 
deriving  the  solution  to  the  analogous  problem  for  ordinary  graphs 
originally  stated  without  proof  by  Cayley  [8],  Renyi  [6l]  remarks  that 
it  can  be  used  to  derive  the  result  corresponding  to  (1.7.7)  given 
by  Clarke  [9]  for  ordinary  graphs.  The  converse  statement  holds  also 
and  in  our  case  we  have 

h-T.W)  RilU{k)  =  (m+l)  O  Fm,n  (k)  ’ 

since  each  graph  counted  in  the  left  member  may  be  obtained  by  choosing 
the  label  for  the  root  in  one  of  m+1  ways  and  then  joining  this  root 


; 


:■  . 

- 

- 

■ 

- 
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to  the  points  . ..,  >  selected  in  1  of  (^)  ways,  for  each 

of  the  F  (k)  graphs  of  the  type  described.  With  (1.7.7)  this  gives 
tn  y  n 

(1.7.11)  F  (k)  =  k  mn‘k  iT"1  . 

Ill  y  II 

When  k  =  1  this  gives  (1.7.9). 

The  formula  for  the  similar  problem  with  respect  to  points  in  P 
may  be  obtained  by  symmetry. 

To  obtain  a  closed  formula  for  the  generalization  of  this 
problem  to  where  points  in  both  P  and  Q  are  specified  as  belonging 
to  different  trees  appears  to  be  more  difficult  although  it  is  easy  to 
write  an  expression  for  this  in  terms  of  the  formulae  just  derived. 

1.8  The  number  of  connected  bigraphs  containing  one  cycle 

After  trees  the  simplest  connected  graphs  are  those  which  consist 
of  a  single  cycle  with  trees  attached  to  various  points  of  the  cycle. 

In  this  section  we  describe  how  to  determine  the  number  of  bigraphs  of 
this  type. 

Such  a  graph  may  be  considered  as  a  configuration  whose  figures 
are  rooted  trees  whose  root  points  are  on  a  cycle  of  length  2k  ;  k  an 
arbitrary  integer  >  2,  where  by  the  length  of  a  cycle  is  meant  the 
number  of  edges  it  contains.  The  figure  counting  series  is  the  counting 
series  for  rooted  bipartite  trees;  but  this  depends  upon  whether  the 
root  point  is  in  P  or  Q.  We  are  thus  presented  with  a  situation  in 
which  we  need  to  modify  Polya's  theorem  so  that  it  will  apply  to  a  case 
where  not  only  are  figures  characterized  by  their  content  but  there  are 
two  distinguishable  classes  of  figures  to  start  with. 


•  .  •  ■  '  .  '  ■■■ 
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Let  there  be  given  two  finite  distinct  sets  of  things  of 
cardinality  m  and  n  ,  respectively,  with  not  both  in  and  n  equal 
to  zero.  Let  G  be  a  subgroup  of  order  h  of  the  group  S^x 
which  permutes  members  of  these  two  sets  with  members  of  the  same  set? 
respectively.  We  define  a  modified  cycle  index  ,  z(G)  ,  by 


(1.8.1) 


Z(G) 


(j),(i) 


h(j)U) 


i 

n 

n 


) 


where  ^1(j)(<g)  denotes  the  number  of  elements  of  G  that  are  of  type 
(  j )  and  ( Jt, )  with  respect  to  the  elements  of  the  first  and  second 
set,  respectively,  f^,  ...,  f^  and  t^,  ...,  t^  are  indeterminates 
and  the  summation  is  over  the  same  partitions  as  in  (1.5.^-). 


Now  suppose  the  things  in  the  first  set  to  be  figures  of  one 
kind  and  those  in  the  second  set  to  be  figures  of  another  kind. 

Defining  equivalence  with  respect  to  G  of  two  configurations  each 
consisting  of  m  figures  of  the  first  kind  and  n  figures  of  the 
second  kind  in  a  way  similar  to  the  definition  in  §1.2  we  may  write, 
for  reasons  similar  to  those  justifying  (1.2.6)  ,  that 

(1.6.2)  F(x)  =  Z(G  5  cp(l)(x),  cp(2)(x))  , 

where  F(x)  denotes  the  configuration  counting  series  and  cp^^(x) 

(2) 

and  cpv  y(x)  denote  the  figure  counting  series  for  figures  of  the 
first  and  second  kind,  respectively.  Corresponding  to  the  ordinary 
case^the  right  member  means  the  expression  obtained  by  substituting 
cp^^x1)  for  f  ,  i  =  1,  ...  ,  m  and  q/2^(xk)  for  \  ,  k  =  1,  ...  ,  n  , 
into  the  right  member  of  ( 1.8.1). 
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(1.8.1)  and  (1.8.2)  admit  of  further  generalization  to  the 


case  where  figures  are  characterized  with  respect  to  more  than  one 
parameter  -  two  in  the  problem  to  be  treated  here  -  and  to  where  there 
are  an  arbitrary  number  of  kinds  of  figures. 

In  the  counting  of  the  number  of  connected  bigraphs  with 
exactly  one  cycle  the  two  figure  counting  series  will  be  rv  '(x,y) 
and  r^\x,y)  ,  the  counting  series  for  P  and  Q  rooted  bipartite 
trees,  respectively.  We  need  to  determine  the  modified  cycle  index 
determined  by  the  symmetries  of  a  cycle  containing  k(>  2)  P  points 
and  k  Q  points  such  that  P  points  are  taken  only  into  P  points 
and  similarly  for  Q  points. 

For  an  ordinary  cycle  of  length  s  in  which  all  points  are  alike 
the  group  of  symmetries  is  generated  by  a  rotation, 


R  =  (1  2  ...  s)  , 


•  •  • 


and  by  a  reflection. 


x .  r  5  -  t  . 

\l  s  s-l  ...  2) 

the  numbers  referring  to  the  points.  Riordan  [84],  p.  150*  has  derived 

the  ordinary  cycle  index  for  this  group,  D  ,  the  dihedral  group,  and 

s 

for  s  =  2k  it  is 


where  cp(j)  denotes  the  number  of  integers  less  than  j  and  relatively 


c  . 
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prime  to  j  ,  with  cp(l)  =  1  ,  and  the  sum  is  over  the  divisors  of  2k. 

In  cycles  of  the  type  we  are  considering  all  the  even  numbered 
points,  starting  the  count  at  some  point,  say  in  P  ,  belong  to  Q 
and  all  the  odd  numbered  points  belong  to  P.  From  the  group  of  all 
symmetries  of  such  a  cycle  we  wish  to  retain  only  the  subgroup  of 
elements  which  take  P  points  into  P  points  and  similarly  with 
respect  to  Q  points.  Then  we  express  the  cycle  behavior  of  these 
elements  with  respect  to  P  and  Q  separately  in  the  form  of  a 
modified  cycle  index. 

The  summed  term  in  (l„8.3)  corresponds  to  the  rotations 
generated  by  the  powers  of  the  rotation  denoted  above  by  R.  It  is 
clear  that  the  admissible  transformations  of  this  set  will  be  those 
involving  an  even  power  of  R.  Since  the  cycle  behavior  of  such 
transformations  is  the  same  with  respect  to  P  and  Q  the  contribution 
of  these  to  the  corresponding  modified  cycle  index  will  be 

l  ><«  ‘T  ■ 

djk 

2  k-1 

The  term  k  f^  f^  in  the  expression  for  Z(D^^)  arises 
from  reflections  typified  above  by  T  ,  which  may  be  written  as 

T  =  (1)  (2,2k)(3,2k-l)  ...  (k,k+2)(k+l)  . 

Suck  reflections  are  all  admissible  in  our  case.  Whether  the  point 
labelled  k+1  is  in  the  same  set  as  the  point  labelled  1  depends 
on  whether  k  is  even  or  odd.  Considering  these  two  cases  separately 
it  is  not  difficult  to  see  that  the  contribution  of  such  reflections  to 


the  modified  cycle  index  will  be 


>  .. 


■. 
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k 

2 


2 

1 


k-2  k 


+ 


k-2 


) 


) 


or 


k-1  k-1 


k  f. 


\  C2 


according  as  to  whether  k  is  even  or  odd,  respectively. 


The  term  k  arises  from  reflections  typified  by 


T  R  =  (1,2)  (3,2k)  ...  (k,k+5)  (k+l,k+2)  . 


All  such  transformations  involve  the  interchanging  of  two  points,  the 
numbers  associated  with  which  are  of  different  parity.  Hence  they  will 
not  be  admitted  to  the  group  of  symmetries  which  preserve  the  identity 
of  the  sets  P  and  Q. 


Combining  the  preceding  expressions  we  have  that  Z(H^)  ,  the 
modified  cycle  index  determined  by  the  group  of  symmetries  of  a  cycle 
of  length  2k  formed  by  alternating  points  from  two  distinct  sets 
which  maintain  the  distinction  between  the  two  sets,  may  be  written  as 


(1.8.4) 


z =  1 


k-2  k  k  lc-2 


,k/d  k/d  k  ,2  _  2  2  k  .2  2  2 

cp(d)  fd'  t^  +  g  fi  f2  C2  +  2  f2  tl  2 


k-1  k-1 


cp(d)  f^/d  t^' d  +  k  f1  f 22  t1  tg2  ^  j 


according  as  to  whether  k  is  even  or  odd,  respectively 


- 
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Letting  C^(x,y)  denote  the  counting  series  for  the  number  of 
nonisomorphic  connected  bigraphs  containing  a  single  cycle,  and  that  of 
length  2k  ,  k  >  2  ,  it  follows  from  (1.8.2)  that 

(l.s.5)  Ck(x,y)  =  J  r(P)(x,y)  ,  r(^(x,y))  . 

The  first  few  terms  of  C0(x,y)  ,  the  simplest  case,  may  be 
found  to  be  as  follows! 

(1.8.6)  C2(x,y)  =  x^y2  +  x2y-'  +  2x2y^  +  2x2y-'  +  ... 

+  5x y  +  7x"y  +  lOx-V'  +  . . . 

+  19x  y  +  43x  y  +  ... 

+  ljSx-'V  +  . . . 

When  the  points  of  the  graph  are  labelled  the  corresponding 
problem  has  been  solved  by  Austin  [2], 

1.9  The  superposition  theorem  for  bigraphs 

Given  k  different  ordinary  graphs  each  on  n  points,  the 
superposition  theorem  due  to  Read  [76]  may  be  used  to  determine,  in 
terms  of  the  automorphism  groups  of  these  graphs,  the  number  of 
nonisomorphic  graphs  on  n  points  which  may  be  formed  by  identifying 
the  points  of  the  different  graphs  in  a  certain  manner.  The 
possibilities  that  more  than  one  edge  may  join  two  points  and  that  an 
edge  may  join  a  point  to  itself  are  admitted  and  using  this  theorem, 
and  in  conjunction  with  Polya's  theorem  in  a  subsequent  paper  [77 ]> 

Read  counted  several  types  of  graphs,  especially  those  in  which 

/ 

certain  restrictions  are  made  on  the  number  of  edges  that  may  be 


: 


•  4 


.  . 


-  . 

' 

. 

incident  on  various  points.  In  this  section  we  derive  the  corresponding 
result  arising  from  making  the  restriction  that  in  all  the  graphs  the 
points  are  partitioned  into  mutually  exclusive  and  exhaustive  classes 
in  such  a  way  that  in  the  superpositioning  process  only  points 
belonging  to  certain  classes  may  be  identified  with  each  other.  This 
will  be  defined  more  precisely  in  what  follows. 

Our  argument,  which  parallels  that  of  Read's  and  which  retains 
his  terminology  insofar  as  possible,  will  be  carried  out  only  for  the 
case  where  there  are  two  graphs  to  be  superposed  and  where  the  points 
in  each  have  been  split  into  two  classes.  However,  it  will  be  seen 
that  the  procedure  is  completely  general. 

Let  there  be  given  two  finite  sets  of  elements,  P  and  Q  ,  of 
cardinality  m  and  n  ,  respectively,  where  not  both  m  and  n  are 
zero.  Let  H.  be  a  subgroup  of  order  h^^  ,  i  =  1,  2,  of  S  x  S  , 
the  direct  product  group  of  the  symmetric  groups  of  degree  m  and  n. 
Consider  the  set  of  all  ordered  pairs,  (a^,  a^)  ,  of  elements  of 

S  x  S  . 

m  n 

Two  such  ordered  pairs,  (a^,  a^)  and  (b^,  b^)  ,  will  be  said 
to  be  L-similar  with  respect  to  and  ,  or  (a^,  a^)  L  (b^,  b^)  , 

if,  and  only  if,  b^  a^  ^  e  for  i  =  1,  2  .  They  will  be  said  to 
be  T-similar  with  respect  to  and  ,  or  (a^,  a^)  T  (b^,  b^)  , 

if,  and  only  if,  there  exists  an  element  x  of  S  S  such  that 
(a^  x,  a^  x)  L  (b^,  b^).  These  both  define  equivalence  relations  and 
we  wish  to  evaluate  the  number  of  equivalence  classes,  (M»N)^,  , 


determined  by  T 
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An  ordered  pair  is  T-similar  to  (a^,  a^)  if,  and  only  if,  it 

is  of  the  form  (h,  an  x,  hrt  a_  x)  ,  x  e  S  V  S  and  h.  e  H.  ,  for 

1122  m  n  1  x 

i  =  1,  2.  For  fixed  (a^,  a0)  there  are  h^^  h/2^  mi  nj  such  pairs 
and  we  first  determine  how  many  times  a  typical  one  is  repeated. 

If  (gx  ax  y,  g2  a2  y)  ,  y  €  S^x  S r  and  g£  e  ^  for 
i  =  1,  2  ,  is  the  same  ordered  pair  as  (h^  a^  x,  h^  a^  x)  ,  then 

(i.9.i)  yx_1  =  a[l  gT1  h±  a±  ’  i  =  1,  2  . 


This  implies  that  yx  ^  is  a  member  of  U(a^,  a^)  ,  the  group  of 
elements  common  to  the  groups  a^  a^  and  a2  .  Given  x  , 

the  tu  ,  and  an  element  of  U(a^,  ,  then  the  g_^  and  y  are 

determined  uniquely.  Hence  if  there  are  u(a^,  a2)  elements  in 
U(a^,  a2)  every  pair  T-similar  to  (a^,  a^)  will  occur  exactly 

u(a^,  a2)  times  in  the  above  set.  Thus  there  are 


(1.9.2) 


h^  h^  mj  ni 


u(aL,  a2) 


pairs  in  the  equivalence  class  to  which  (a..,  a  )  belongs, 

1  td 


The 


pairs  in  the  equivalence  class  with  (a^,  a2)  contribute 


(1.9.3) 


h^  h^  ml  n* 


u(ax,  a2) 


“  *  u(a1,  a2) 


to  ^  u(a^,  a2)  ,  summed  over  the  ordered  pairs  (a^,  a2).  As  this 
is  independent  of  the  pair  (a^,  a 2)  it  follows  that 


t 


- 
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(1.9.4)  ^  u(aV  *2)  =  ml  nl  (M*N)T  * 

To  evaluate  ^  u(a^,  a^)  we  need  to  determine  for  each  pair 
(a^,  a^)  the  number  of  pairs  of  elements  v^  e  and  Vg  e  Hg  such 
that 


(1.9.5) 


ai1  vi  ai 


-1 

^2  vg  ^2  • 


Or  we  may  choose  the  first  and  then  determine  the  number 

of  pairs  (a^,  a^)  satisfying  (1.9.5).  An  alternate  form  of  (1.9.5) 
is 

(1.9.6)  v  =  a^  a ^  Vg  a^  a^  =  b  ^  Vg  b  , 

where  b  =  ag  a^\  F°r  eac^  element  b  satisfying  (1.9.6)  there 
are  mj  nj  pairs  satisfying  (1.9.5)  5  namely  (a^,  b  a^)  for  each 

a1  €  S  X  &  • 

Iron 


Any  element  of  3^  may  be  interpreted  as  being  a  permutation 

of  the  elements  of  P  performed  in  conjunction  with  a  permutation  of 
the  elements  of  Q  ,  where  P  and  Q  have  been  .defined  earlier. 

Since  v^  and  Vg  are  conjugate  elements,  by  (1.9.6),  it  follows 

that  they  have  the  same  cycle  character  with  respect  to  the  elements 

of  P  and  Q  ,  respectively.  For  fixed  v^  and  vQ  ,  of  type 

(j)  =  ( Jp  ...  ,  3m)  with  respect  to  P  and  of  type  (&)  =  ...  ,  £^) 

with  respect  to  Q  ,  the  number  of  different  solutions,  b  to  (1.9.6) 

is  the  number  of  ways  of  writing  Vg  under  v^  ,  when  both  are  expressed 

as  a  product  of  disjoint  cycles,  in  such  a  way  that  all  the  cycles  of 


1  I 
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length  one  with  respect  to  P  come  first,  followed  by  the  cycles  of 
length  two  with  respect  to  P  ,  etc.,  and  similarly  for  the  cycles  of 
permutations  of  elements  of  Q  ,  or 


(1.9.7)  V 


i 

n 

n 


See  e.g,  Murnaghan  [69],  PP.  9-10. 

Multiplying  by  m.'  nj  for  reasons  already  given  we  see,  since 

there  are  h/^w.\  h/^w.v  pairs  of  elements  v.  €  H.  and  v_  e  HL 
(j)U)  (j)U)  112  2 

both  of  cycle  type  (j)  and  (i)  with  respect  to  P  and  Q  , 
respectively,  using  the  notation  of  §1.8,  that  the  total  contribution 
to  ^  u(a^,  a^)  of  the  v^'s  of  type  (j)  and  ($,)  is 

(1.9.6)  n:  h[*J(i)  T(j)w  T(J)(»)  , 

where  T^^(m)  =  j 
Summing  this  over  the  same  partitions,  (j)  and  (i)  ,  as  described 
for  (l.3.*0  and  using  (1.9.^-)  completes  the  proof  of  the  following 
lemma. 

Lemma  1.9.1.  (M-H)t  =  (l)  l2)  h(j)(i)  T(j)(m:i  T(i)(n)  ‘ 

h  h  (j)U) 

When  k- tuples  of  elements  of  S  %  S^  are  being  considered, 
k  >  2  ,  the  T's  are  raised  to  the  power  k-1  ,  besides  the  more 
obvious  changes,  since  there  would  be  k-1  equations  corresponding 
to  (1.9.8)  all  of  which  have  the  same  number  of  solutions  to  be 


1* 


m 


m 


m 


and  similarly  for 


T(i)(n) 


counted. 
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We  may  observe  that  the  modified  cycle  indices  of  the  groups 
and  ,  as  defined  in  (1.8. l)  contain  all  the  information 

necessary  to  use  this  lemma. 


We  now  develop  the  superposition  theorem  for  bigraphs. 


Let  and  be  two  m  by  n  bigraphs  each  of  whose 

point  sets  are  labelled  P  =  {P^,  ...  ,  P  )  and  Q  =  [Q^,  . ..,  Q^}  . 
A  superposition  of  G^  and  G^  is  defined  as  any  bigraph  which  may 
be  obtained  by  permuting  the  labels  of  the  points  of  G_^  according 
to  any  permutation 


a . 
1 


P 

m 


where  a^  £  y.  ,  for  i  =  1,  2,  and  then  identifying  all  points 
in  G^  and  G^  which  have  the  same  label.  In  the  superposed  graph 
thus  obtained  we  shall  assume  that  the  edges  which  come  from  G^  are 
distinguishable  from  those  which  come  from  G^  ,  say  from  having  a 
different  colour. 

Two  different  graphs  both  obtained  by  superpositioning  G^  and 
G^  will  be  called  similar  as  labelled  graphs  if,  and  only  if,  the  same 
number  of  edges  of  both  colours  join  P_^  to  in  both  graphs, 

i  =  1,  ...  ,m,  and  j  =  1,  ...  ,  n  J  they  will  be  called  isomorphic 
if,  and  only  if,  the  application  of  some  permutation  a  £  X  to 

the  labels  of  the  points  of  one  of  the  graphs  renders  them  similar  as 


labelled  graphs. 
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Those  permutations  of  X  which  when  applied  to  the  labels 

of  the  points  of  G_^  give  a  graph  similar  as  a  labelled  graph  to  G^, 

form  a  group,  the  automorphism  group,  EL  ,  of  G_^  ,  for  i  =  1,  2  . 

It  follows  that  the  two  graphs  obtained  from  G^  by  applying  the 

permutations  a^  and  b_^  will  be  similar  as  labelled  graphs  if,  and 

only  if,  b.  a.  ^  €  H.  ,  for  i  =  1,  2  . 
ii  l 

Any  other  graphs  obtainable  from  G^  and  G^  through  an 
admissible  permutation  of  the  labels  of  the  points  may  be  indicated 
by  specifying  a  pair  of  elements,  (a^,  a^)  ,  of  \  S^.  The 
superposed  graphs  corresponding  to  (a^,  a0)  and  (b^,  b0)  will  be 
similar  as  labelled  graphs  if,  and  only  if,  b_^  a^  e  IL  ,  for 
i  =  1,  2,  hence  if,  and  only  if,  (a^,  a^)  L  (b^,  b^). 

Furthermore,  the  superposed  graphs  corresponding  to  (a^,  a^) 
and  (b^,  bD)  will  be  isomorphic  if,  and  only  if,  there  exists  an 
element  x  €  such  that  the  graph  corresponding  to  (a^  x,  a^  x) 

is  similar  as  a  labelled  graph  to  the  graph  corresponding  to  (b^,  b^)  , 
hence,  if,  and  only  if,  (a^,  a^)  T  (b^,  b0).  But  the  number  of 
T-inequivalent  pairs  has  been  determined  already,  which  completes  the 
proof  of  the  superposition  theorem  for  bigraphs. 

Theorem  1.9.1.  The  number  of  nonisomorphic  ra  by  n  bigraphs 

obtainable  by  superposing  the  m  by  n  bigraphs  G^  and  G0  is  the 
number  (M.N)^  ,  of  Lemma  1.9.1  >  where  and  are  the 

automorphism  groups  of  G^  and  G^  respectively. 

In  a  similar  fashion  the  more  general  theorem  for  superposing 
an  arbitrary  number  of  bigraphs  comes  from  the  more  general  form  of  the 


■  •  ' 


lemma.  Corresponding  results  could  be  obtained  for  superposing 
n-partite  graphs,  and  the  theorem  is  equally  valid  when  the  restriction 
that  no  more  than  one  edge  may  join  two  points  in  different  subsets  is 
dropped. 

1. 10  The  number  of  functional  directed  bigraphs 

The  counting  series  for  the  number  of  nonisomorphic  functional 
digraphs,  i.e.  directed  graphs  in  which  each  point  is  of  outdegree  one, 
was  obtained  by  Harary  [4j]  and  later  simplified  by  Read  [793.  The 
method  involved  a  repeated  application  of  P6lya's  theorem  based  upon 
an  analysis  of  the  structure  of  the  digraph  in  terms  of  the  length  and 
number  of  the  cycles  it  contained.  (A  cycle  in  a  directed  graph  is  the 
same  as  a  cycle  in  an  undirected  graph  except  that  its  edges  are  all 
directed  in  the  same  sense  and  the  possibility  is  also  admitted  of 
there  being  cycles  of  length  two. )  Essentially  the  same  procedure, 
utilizing  the  results  of  §§  1.4  and  1.8,  would  suffice  to  treat  the 
corresponding  problem  for  bigraphs.  However,  in  this  section  we  shall 
pursue  an  alternate  approach  making  use  of  the  superposition  theorem 
for  bigraphs.  A  more  precise  formulation  of  the  problem  follows. 

Let  there  be  given  two  distinct  sets  of  points,  P  =  [P^,  ...,  P^ 
and  Q  =  {Q  ,  ...,  Qn)  ,  m,n  >  1  ,  such  that  from  each  point  in  P 
there  issues  one,  and  only  one,  edge  directed  towards  some  point  in  Q  , 
and  similarly  with  P  and  Q  interchanged.  Such  a  configuration  is 
called  a  functional  directed  m  by  n  bigraph  and  may  be  interpreted 
as  arising  from  an  ordinary  functional  digraph  in  which  the  points  are 
separated  into  two  disjoint  subsets  and  only  points  which  belong  to 


different  subsets  may  be  adjacent.  The  definition  of  isomorphism  of 
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two  such  graphs  follows  that  given  in  the  preceding  section  as  does  the 
definition  of  the  automorphism  group  of  the  graph. 

It  is  natural  to  consider  any  functional  directed  bigraph  as 
being  a  superposed  graph  in  which  the  edges  directed  towards  points  in 
Q  originate  from  one  graph,  call  it  ,  and  the  edges  directed 


towards  points  in  P  originate  from  a  second  graph,  call  it  . 

Hence,  by  Theorem  1.9. 1,  obtaining  the  modified  cycle  indices  of  H 


(1) 


(2) 

and  H  ,  the  automorphism  groups  of  G^  and  G?  ,  respectively, 

will  enable  us  to  determine  w  ,  the  number  of  nonisomorphic  m  by  n 

m,n  ’  r  J 

functional  directed  bigraphs. 


The  graph  G^  ,  containing  ro  directed  edges  one  from  each  P 
point  towards  a  point  of  Q  ,  may  be  characterized  up  to  an  isomorphism 
by  a  set  of  nonnegative  integers,  (g)  =  (gQ,  g  ,  ...,  g^)  ,  where  g 
is  the  number  of  points  of  Q  of  indegree  k.  It  follows  that 


(1.10.1) 


g0  +  §i  + 


+  g  =  n 
m 


and 


(1.10.2)  1  •  +  2  •  gg  +  ...  +  m  •  gm  =  m  . 


,(D 


It  is  clear  that  under  automorphisms  of  H  only  those  P 
points  whose  edge  is  directed  towards  the  same  Q  point  can  be  mapped 
into  each  other  and  that  any  point  of  Q  can  only  be  mapped  into 
another  point  of  Q  whose  indegree  is  the  same.  Hence,  to  determine 
H^^  ,  or  rather  its  modified  cycle  index,  we  may  first  consider  the 
automorphism  group  of  the  subgraph  consisting  of  the  g^  Q  points  of 
indegree  k  ,  the  k'g^  edges  directed  towards  these  points,  and  the 
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k*g,  P  points  from  whence  these  edges  are  directed,  for  0  <  k  <  m. 
The  case  k  =  0  will  be  trivial. 


Consider  a  g^  by  k  rectangular  array  in  which  the  rows 
correspond  to  the  g^  Q  points  of  indegree  k  in  some  order  and 
the  entries  in  the  rows  represent  the  k  P  points,  in  some  order, 
from  which  edges  are  directed  towards  the  Q  point  to  which  the  row 
corresponds.  The  group  of  the  subgraph  being  considered  then  consists 
of  any  automorphism  that  may  be  obtained  by  first  permuting  the 
entries  in  each  row  according  to  some  element  of  ,  the  symmetric 
group  of  degree  k  ,  not  necessarily  the  same  permutation  for  each  row, 
and  then  permuting  the  rows  themselves  according  to  some  element  of 
Sg^  ,  the  symmetric  group  of  degree  g^.  This  group  is  known  as  the 
composition,  Sg^[S^]  ,  of  Sg^  with  and  the  cycle  index,  with 

respect  to  the  application  of  elements  of  the  group  to  the  P  points 
involved,  is  obtained  by  "substituting"  the  cycle  index  of  into 

that  of  Sg^  ,  as  shown  in  Polya  ["jh],  P.  17&.  "Substituting"  means 
that  if  Z(Sg^)  =  U(cp^,  ...,  cp^)  then  cp^  is  replaced  by 

V(0i;  92V  •••'  0ki^  '  where  v(0!>  ..•>  0k)  =  z(sk).  But  the 

modified  cycle  index,  Z(Sg^[S^])  ,  in  which  we  are  interested, 

indicates  the  cycle  character  of  the  elements  of  the  automorphism 

group  with  respect  to  the  corresponding  points  of  Q  as  well.  To 

accomplish  this  it  is  easily  seen  that  we  should  replace  each  term 

in  Z(Sg^)  by  the  product  of  that  term  and  the  expression  obtained 

from  it  by  "substituting"  in  Z(S^_).  This  enables  us  to  obtain 


Z(Sg^[S^])  from  Z(Sg^)  and  ^(Bk)  }  both  of  which  are  known,  in 
principle  at  least,  by  (1.2.5). 


. 

- 

. 
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An  example  may  clarify  this.  Let  gg  =  2  .  The  configuration 
satisfying  this  has  the  following  appearance: 


1  2 

Since  Z(Sg)  =  —  (ti  +  tg)  the  modified  cycle  index  of  the 
automorphism  group  of  this  configuration  is 


5(s2[s2]) 


=  \  {  [g(ef  +  £ a)]2  t\  +  [|(f2  +  £4)]  tg 


=  i  (f4  t2 
6  '  1  1 


+  2f^  fg  t^  +  fg  t^  +  2fg  tg  +  2f^  t2)  , 


as  may  be  seen  by  inspection. 

Applying  the  same  argument  for  each  value  of  k  gives, 
according  to  Polya  [7*0,  P.  177 >  the  result  that  the  modified  cycle 
index  of  the  automorphism  group  of  corresponding  to  the  set  of 

numbers  (g)  is 

m 

(1.10.3)  Z  (h[4|)  =  n  z  (s  gu  [sk])  . 

k=G 

In  the  same  way  the  graph  Gg  ,  containing  n  directed  edges 

one  from  each  point  in  Q  to  some  point  in  P  ,  may  be  characterized 

up  to  an  isomorphism  by  a  set  of  nonnegative  integers, 

b  )  ,  where 
n'  ’ 


(b)  =  (bo,  bv 


. .  - 
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(1.10.4)  bQ  +  bx  +  ...  +  bn  =  m 

and 


(1.10.5)  +  2*b2  +  •  ••  +  n*bn  =  n  • 

We  again  find,  with  obvious  notation,  that 

n 

(1.10.6)  z  (H^j)  =  J]  z  (Sb  [s^])  . 

i= 0  J 

This  would  enable  us  to  count  the  number  of  m  by  n 
functional  directed  bigraphs  the  indegrees  of  whose  points  satisfied 
the  conditions  implied  by  the  values  of  the  numbers  in  (g)  and  (b). 
To  count  all  of  them  we  should  sum  over  all  (g)  and  (b)  satisfying 
(l.lO.l)  and  (1.10.2),  and  (1.10.4)  and  (1.10.5)*  respectively. 
Using  Theorem  1.9.1  this  is  summarized  in  the  following  theorem. 


Theorem  1.10.1. 


m 


w 


ra,n 


n  W  1 

gk  _  b,N-i 


(g),(b)  k=G 


n  u  >  "(flu)  h((])(i)  T(j)(m)i 

i=0  (jfti) 


where  ^(j)(^)  anc*  ^(j)(i)  are  nun,^er  automorphisms  of  type 

(j)  with  respect  to  the  points  of  P  and  of  type  {i)  with  respect 

to  the  points  of  Q  in  and  >  respectively.  The  inner  sum 

is  over  the  same  partition!  as  described  for  (1.5*4). 

00 

If  w(  x,y)  =  ^  w^  n  x111  y11  ,  then  one  finds  using  this 

ro,n=l 


theorem  that 

(1.10.7)  W(x,y) 


2  5 

xy  +  xy  +  xy' 

2  2  2  ^ 
+  px  y  +  7x  y^ 


+ 

4 

xy 

+ 

5 

xy^ 

+ 

+ 

,,  2  4 

llx  y 

+ 

P 

13* V 

+ 

+ 

49x5yJ+ 

+ 

85x?y5 

+ 

+ 

4  4 

159x  y 

+ 

278x2fyp 

+ 

1  .  5  5 

U)(n 
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CHAPTER  II 

ON  BIPARTITE  TOURNAMENTS 


2.1 


Definitions 


A  set  of  n(>  l)  points  such  that  each  pair  of  distinct  points 

is  joined  by  a  single  edge  that  is  oriented  towards  exactly  one  of  the 

points  is  called  a  (round-robin)  tournament .  One  generalization  of 

this  definition  is  as  follows:  A  set  of  n(>  l)  distinct  subsets  of 

points,  P^  =  [P^,  **’>  P£n  )  with  nj  >  1  for  i  =  1,  ••*,  n,  such 

i 

that  every  pair  of  distinct  points  not  belonging  to  the  same  subset  is 
joined  by  an  edge  oriented  towards  exactly  one  of  the  points  is  called 
an  n-partite  tournament  if  there  are  no  edges  joining  points  in  the 
same  subset.  If  the  edge  joining  P^  to  P^  is  oriented  towards 
the  latter  point  this  is  indicated  by  writing  P,^  ->  P ^  and  similarly 
if  the  orientation  is  in  the  opposite  sense.  An  ordinary  tournament  on 
n  points  is  the  same  as  an  n-partite  tournament  with  n^  =  n^  =  • • •  =  n^ 


In  this  chapter  we  investigate  a  few  of  the  properties  of 
n-partite  tournaments  and  extend  some  of  the  results  known  about 
ordinary  tournaments.  The  main  emphasis  is  upon  the  case  n  =  2, 
bipartite  tournaments.  An  m  by  n  bipartite  tournament  may  be 
more  concisely  defined  as  a  complete  oriented  m  by  n  bigraph,  where  a 
complete  graph  of  any  kind  is  one  which  contains  as  many  edges  as 
possible,  consistent  with  its  definition.  Tournaments  have  been 
studied  in  connection  with  the  method  of  paired  comparisons  (see  e.g. 
Kendall  [55]  and  David  [12])  and  a  bipartite  tournament  provides  a 


: 
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model  for  the  comparing  of  each  member  of  one  population  with  each 
member  of  a  second  population  and  making  a  decision,  upon  some 
predetermined  basis,  as  to  which  of  each  pair  is  the  preferred  one. 

To  avoid  repetition  we  remark  at  the  outset  that,  unless 
otherwise  stated,  all  statements  of  a  probabilistic  nature  for  bipartite 
tournaments  are  under  the  assumption  that  all  possible  outcomes  in  a 
given  context  are  equally  likely  and  that  the  orientations  of  the  edges 
incident  upon  one  point  are  independent  of  the  orientations  of  the  edges 
incident  upon  another  point  in  the  same  subset. 

2,2  Conditions  for  an  oriented  graph  to  be  a  bipartite  tournament . 

Consider  an  arbitrary  oriented  graph.  We  seek  conditions  under 
which  the  points  of  the  graph  may  be  assigned  membership  in  one  of  two 
sets,  P  and  Q,  in  such  a  way  that  the  resulting  configuration  may 
be  considered  as  being  a  bipartite  tournament  on  the  point  sets  P 
and  Q.  Clearly  this  property  is  independent  of  the  orientation  of  the 
edges  in  the  graph. 

These  conditions  are  contained  in  the  following  result. 

Theorem  2.2.1.  An  arbitrary  oriented  graph,  G,  with  V  points  and 
E  edges  may  be  considered  as  a  bipartite  tournament  if,  and  only  if, 
the  graph  G'  obtained  from  G  by  ignoring  the  orientations  of  the 
edges  satisfies  the  following  two  conditions: 

(a)  All  cycles  of  G*  are  of  even  length. 

(b)  m  points  of  G'  are  of  degree  V  -  m  and  the 

remaining  V  -  m  points  are  of  degree  m,  where  m  is 

a  nonnegative  integer  such  that  m  <  V  -  m.  (If  this 

is  the  case  then  m  =  \  (V  -  (V  -  '^E)2).  ) 


V  i 
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Any  ordinary  graph  may  be  considered  as  being  a  bigraph  if,  and 

only  if,  its  cycles  are  all  of  even  length,  by  Konig  [57] >  P-  151*  The 

necessity  of  (b)  as  well  is  obvious. 

To  demonstrate  the  sufficiency  of  the  conditions  we  observe  first 
that  (a)  and  Konig' s  theorem  imply  that  the  points  of  G*  may  be  separ¬ 
ated  into  two  mutually  exclusive  and  exhaustive  subsets  of  m  and  V  -  m 
points,  respectively,  where  m  <  V  -  m,  such  that  no  two  points  of  the 
same  subset  are  adjacent. 

Assume  m  >  m.  This  and  (b)  imply  that  all  the  V  -  m  points  of 
degree  m  must  belong  to  the  subset  containing  m  points,  since  other¬ 
wise  there  wouldn't  be  enough  points  notin  the  same  subset  with  them  to 
enable  them  to  have  the  specified  degrees.  But  this  implies  that 
V  -  m  <  in,  or  V  <  m  +  in.  It  further  implies  that  m  <  V  -  m  or 

m  +  in  <  V.  Hence  m  =  V  -  m  >  m  and  m  =  V  -  m  >  in,  or  m  =  m, 

contradicting  the  assumption. 

The  assumption  m  <  m  yields  the  same  contradiction,  so  we  have 
as  the  only  alternative  that  m  =  m  . 

If  m  <  V  -  m  ,  then  none  of  the  points  in  the  subset  with  V  -  m 
points  can  be  of  degree  V-m=V-m>m.  Hence  all  the  points  in 
the  subset  with  m  points  must  be  of  degree  V  -  m  =  V  -  m  and  all  the 
remaining  points  in  the  class  of  V  -  m  =  V  -  m  points  must  be  of  degree 
m  =  m  . 

If  m  =  V  -  m  =  V/2  then  every  point  is  of  degree  V J2.  and 
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since  no  point  can  be  adjacent  to  any  of  the  remaining  V/2  -  1  points 
in  the  same  subset  as  itself  it  must  be  adjacent  to  all  the  V/2  members 
of  the  other  subset. 

In  either  case  restoring  the  original  orientations  to  the  edges 
yields  a  bipartite  tournament  by  definition,  completing  the  proof  of  the 
theorem. 


To  avoid  the  trivial  case  m  =  0  we  need  only  demand  that  the 
original  graph  be  nontrivial,  i.e.  that  it  contain  at  least  one  edge. 

That  there  is  no  such  simple  characterization  of  those  oriented 
graphs  which  may  be  considered  as  n-partite  tournaments  for  n  >  2 
follows  from  the  fact  that  there  is  no  analogue  to  Konig's  theorem  for 
these  cases,  as  cycles  of  any  length  (>  3)  are  possible  in  n-partite 
graphs  for  n  >  2  .  (See  Berge  [3],  p.32). 

The  total  number  of  oriented  graphs  with  m(V  -  m)  edges, 
m  <  V  -  m,  and  V  points  is 


the  first  factor  being  the  number  of  ways  of  selecting  the  edges  and  the 
second  being  the  number  of  ways  of  assigning  the  orientations.  The 
number  of  these  which  may  be  regarded  as  being  a  bipartite  tournament  is 


(2.1.2) 


(  V  )  .  Pm(v  "  m) 

m  ^ 


if  m  <  V  -  m  ,  and  one  half  of  this  if  m  =  V  -  m.  Dividing  (2.1.2) 
by  (2.1.1)  gives  the  probability  that  a  random  oriented  graph  with 
m(V  -  m)  edges  and  V  points  may  be  considered  as  being  a  bipartite 
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tournament . 


Similarly  one  finds  that  the  overall  probability  that  a  random 
oriented  graph  on  V  points  may  be  considered  as  being  a  bipartite 
tournament  is 


V-l 


m=0 


(V)  .  2»(V  -  .) 


9 


if  V  is  odd,  and  the  sum  should  contain  the  additional  term 


( 


2(V/2)2  -1 


when  V  is  even. 


The  factor 


arises 


from  the  fact  that  to  determine  any  oriented  graph  on  V  points  three 

V\ 

choices  must  be  made  for  each  of  the  pairs  of  distinct  points; 

which  way  to  orient  the  edge  joining  the  two  points  if  they  are  to  be 
joined  or  not  to  join  them  at  all. 


2.3  On  the  score  sequence  of  an  n-partite  tournament . 

By  the  score  of  P_,  a  P°int  an  n-partite  tournament  is  meant 

the  outdegree  of  the  point  and  will  be  denoted  by  v^j*  By  the  soore 

sequence  of  an  n-partite  tournament  is  meant  the  sets  of  scores 

V.  =  (  v . . ,  v.  ),  i  =  1,  •••,  n.  With  no  loss  of  generality  we 

1  ll  in. 

1 

may  assume  that  v.,  <  v  <  •••  <  v.  ,  for  all  i.  In  this 

J  ll  —  i2  _  —  m. 

1 

section  we  give  necessary  and  sufficient  conditions  for  n  sets  of 
nonnegative  integers  to  be  the  score  sequence  of  an  n-partite 
tournament . 


These  are  contained  in 

Theorem  2. 3.1.  n(  >  l)  sets  of  nonnegative  integers 


- 


«■-  •  \ 
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V.  =  (v._,  •••,  v.  ),  when  v.,  <  v._  <  •••  <  v.  and  n,  >  1 

1  ll  in.  il  —  i2  —  “in.  i  — 

l  l 

for  i  =  1,  *•*,  n,  form  the  score  sequence  of  some  n-partite  tournament 
if,  and  only  if, 

r» 

n- 1  n 


n  ni 


(a)  I  I  vi: 


i=l  j=l 


i=l  j=i+l 


n .  n . 
i  J 


and 


(b) 


n  ki 


i=l  j=l 


Vii  * 


n- 1  n 


i=l  j=i+l 


k.  k.  , 
i  J 


for  all  sets  of  integers  k^,  i  =  1,  •••,  n,  where  0  <  k^  <  n^  ,  but 


not  all  k,  =  n.  . 

l  l 


It  is  easily  seen  that  (a)  and  (b)  imply  that 

n 


0  <  v. .  < 

“  ij  “ 


s=l 


n  ]  -  n.  ,  for  all  i  and  j 

s  J  i  J 


invo lved . 


That  the  conditions  are  necessary  is  obvious  since  (a)  simply 
requires  that  the  sum  of  all  the  scores  be  equal  to  the  total  number  of 
edges  in  the  configuration  and  (b)  demands  that  the  sum  of  the  scores  of 
the  points  in  any  proper  subset  of  the  tournament  be  at  least  as  large 
as  the  number  of  oriented  edges  joining  points  in  this  subset. 

Before  proceeding  to  the  proof  of  sufficiency  it  will  be  convenient 
to  state  a  few  preliminary  results. 


If  the  points  of  an  n-partite  tournament  can  be  separated  into  two 
mutually  exclusive  and  exhaustive  subsets,  A  and  B,  such  that  every 


•i  <  t 


•; 


-r 


. 
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edge  which  joins  a  point  in  A  to  a  point  in  B  is  oriented  towards 
the  latter  point  then  the  configuration  will  be  said  to  be  reducib le . 

To  avoid  making  exceptions  later  the  trivial  1-partite  tournament  will 
be  considered  reducible  except  when  n^  =  1  .  An  n-partite  tournament 
is  irreducib le  if  it  is  not  reducible. 

Dulmage  and  Mendelsohn  [15]  have  applied  the  term  irreducible 
to  bigraphs  with  a  different  meaning.  However  no  ambiguity  should  arise 
because  the  sense  just  described  is  the  only  one  which  will  be  used  here. 
The  concept  of  irreducibility  of  tournaments  as  defined  here  resembles 
somewhat  the  concept  of  irreducibility  of  a  Markov  chain,  (See  e.g. 

Feller  [26],  p.  349) * 

Lemma  2.3*1.  An  n-partite  tournament  is  irreducible  if,  and  only  if, 
its  score  sequence  satisfies  condition  (b)  with  strict  inequality  hold¬ 
ing  throughout. 

This  follows  immediately  from  the  definitions  of  the  terms  involved. 


An  n-partite  tournament  is  defined  as  being  strongly  connected 


if,  and  only  if,  for  each  ordered  pair  of  distinct  points,  P„  and 

there  exists  an  oriented  path  from  P .  to  Pw,  i.e.  a  sequence 


ij 


k£ 


of  distinct  points  whose  first  element  is  P..,  whose  last  element  is 

ij 

P^,  and  such  that  from  each  point  in  the  sequence  there  issues  an  edge 
oriented  towards  its  successor  in  the  sequence. 


The  following  statement  is  similar  to  a  result  published  by  Roy 


[86]  but  the  argument  is  somewhat  different. 


•  :v.i 


. 


-  - 


Lemma  2. 3* 2.  An  n-partite  tournament  is  strongly  connected  if,  and 
only  if,  it  is  irreducible. 


The  necessity  of  the  condition  is  obvious  and  if  the  tournament 
is  not  strongly  connected  there  are  two  points,  P  and  P^,  such 
that  if  B  is  the  set  of  all  points  which  can  be  reached  from  P _  by 
some  oriented  path  then  P^  is  in  A,  the  set  of  all  points  of  the 
tournament  not  in  B.  Moreover,  any  edge  which  joins  a  point  in  A  to 
a  point  in  B  must  be  oriented  towards  the  latter  point,  which  implies 
reducibility  by  definition. 


Straightforward  considerations  are  sufficient  to  establish  the 
following  result. 


Lemma  2.3.3.  Given  sets  of  integers,  V^,  i  =  1,  ••*,  n,  satisfying 
the  conditions  of  Theorem  2.3*  for  which  there  exist  integers,  h^  , 


0  <  h.  <  n.  ,  for  i  =  1,  •••,  n,  with  not  all  h.  =  n.  ,  such  that 
—  1  —  1  11 

f 

equality  holds  in  (b)  when  =  h^  .  Then  the  sets  V_^  =  (v^,*“,  ) 

i 

and  the  sets  V."  =  (v  ,  -  b.,  ***  v.  -b  . ) ,  where 


ih.  +  l 
1 


in^  x' 


n 


b.  = 
1 


s=l 


-  h.  ,  satisfy  the  conditions  of  Theorem  2.3.1  with 
1 


n^  replaced  by  h^  and  n_^  -  h_^  ,  respectively  for  i  =  1,  ••*,  n  . 


Hence,  if  there  exist  two  n-partite  tournaments  whose  score  sequences 
are  the  V^'’s  and  the  V^M  ’s  ,  respectively,  then  combining  them  in  a 
rather  obvious  fashion  yields  an  n-partite  tournament  whose  score  sequence 
consists  of  the  original  sets,  V^,  i  =  1,  *•*,  n.  The  interpretation  is 
clear  whenever  any  h^  equals  0  or  n^  . 


' 
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The  proof  of  the  sufficiency  part  of  Theorem  2.3*1  now  proceeds 
by  multiple  induction,  first  over  the  number  of  sets  of  points  involved. 

The  case  n  =  1  is  trivial,  so  we  now  assume  that  the  theorem 
has  been  established  for  up  to  n  -  1  sets  of  points,  for  some  n  >  1. 

Suppose  we  have  n  sets,  V.  ,  of  integers  satisfying  the 


conditions  of  Theorem  2.3*1  in  which  v  .  =  v  =  ***  = 

^  nl  n2 


=  v 


nn  =0.  By 
n 


the  induction  hypothesis  and  Lemma  2.30,  with  hl  =  h2  ‘  ”•  =  Vl  =  0 

and  h  =  n  ,  we  may  assert  the  existence  of  an  n-partite  tournament 
n  n 

whose  score  sequence  consists  of  the  * s  . 

For  the  second  induction  assume  the  required  result  has  been 

proved  for  all  cases  where  0  <  v  <  t  -  1  and  consider  n  sets, 

—  nn  — 
n 

V\  ,  of  integers  which  satisfy  the  conditions  of  Theorem  2.3.1  with 

v  =  t  and  with  strict  inequality  holding  throughout  in  (b). 
n 

Our  third  induction  is  now  upon  Z  ,  the  number  of  scores  in 

V  equal  to  t. 
n 


If  Z  =  1  modify  the  given  sets,  V_^,  by  subtracting  one  from 

v  ,  which  is  greater  than  or  equal  to  one  by  hypothesis,  and  by 
n 

adding  one  to,  say,  the  smallest  v_  ,  where  i  <  n,  in  such  a  way 
as  to  leave  all  the  numbers  in  each  set  still  arranged  in  nondecreasing 
order.  That  is,  if,  say,  (v. . ,  v  ,  v  )  =  (0,  0,  l)  ,  one  would  be 

II  LC.  15 

added  to  v\2.  a^d  not  to  v-q  •  This  change  does  not  affect  the  total 
sum  of  the  scores,  so  (a)  is  still  satisfied.  Also,  any  of  the  sums  in 
(b)  are  decreased  by  at  most  one  but  as  strict  inequality  held  throughout 


.  . 

.  • 


.  . 

.  . 


i 

.  . 

•  ^ 
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originally,  by  assumption,  it  follows  that  the  modified  sets  of  numbers 
still  satisfy  (b)  as  well. 


The  maximum  score  in  V  is  now  t  -  1  so  by  the  induction 

n 

hypothesis  for  the  second  induction  there  exists  an  n-partite  tournament 
whose  score  sequence  is  the  set  of  numbers  obtained  by  modifying  the 
V/s.  We  need  now  to  change  this  n-partite  tournament  into  one  whose 
score  sequence  consists  of  the  original  V^’s. 


Let  P  be  the  point  whose  score  was  increased  by  one.  Two  cases 

now  present  themselves.  If  P..-+P  in  the  tournament  whose  scores 
r  lj  nn 

n 

are  the  modified  V.’s  then  simply  reversing  the  orientation  of  the  edge 


joining  P..  and  P  yields  a  configuration  whose  score  sequence  is 
ij  nnn 

the  original  set  of  V.  ’  s.  If,  however,  P  ->  P. .  another  approach 

l  nn  ij 

n  J 

is  required. 


If  the  n-partite  tournament  assured  us  by  the  induction  hypothesis 

is  reducible,  then  P  must  be  in  the  set  called  B  in  the  definition 

nn 


n 

of  reducibility .  This  follows  from  Lemma  2.3. 1  and  the  fact  that  any 

partial  sum  of  the  numbers  in  the  modified  set  of  V^’s  giving  equality 

in  (b)  must  have  included  the  score  of  P  .  P..  must  also  be  in  B 

v  '  nn  11 

n  J 

since  P  ->  P . .  .  But  this  is  impossible,  using  Lemma  2.3. 1  again, 
n 

since  any  partial  sum  in  (b)  of  the  numbers  in  the  modified  set  of  V^’s 

which  includes  the  scores  of  both  P..  and  P  is  unchanged  from  what 

ij  nn 

J  n 

it  was  before  any  of  the  numbers  were  changed;  hence  strict  inequality 

would  hold  and  the  only  alternative,  under  our  assumptions,  is  that  if 

P  -*■?,.  then  the  tournament  is  irreducible, 
nn  n 


. 


.  .  I ' 
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By  Lemma  2.3.2  there  exists  an  oriented  path  from  P . .  to  P  , 

^  lj  nn 

J  n 

The  only  effect  upon  the  scores  caused  by  reversing  the  sense  of  orient¬ 
ation  of  all  edges  in  this  path  will  be  to  decrease  P^’s  by  one  an^ 

to  increase  P  ’ s  by  one. 
nn 

n 

In  either  case  we  have  produced  an  n-partite  tournament  whose 
score  sequence  is  the  original  set  of  V^’s. 

We  now  drop  the  assumption  that  strict  inequality  held  throughout 
in  (b).  What  has  already  been  shown  along  with  the  application  of  Lemma 
2.3.3,  repeated  if  necessary,  completes  the  proof  for  the  case  Z  =  1  . 

Now  assume  that  the  assertion  has  been  demonstrated  for  v  =  t 

nn 

n 

and  up  to  Z  =  r  -  1  >  1  .  We  consider  sets  of  numbers  V^,  i  =  1,  ••*,  n, 

in  which  the  last  r  elements  of  V  are  equal  to  t,  and  which  satisfy 

n 

the  conditions  of  Theorem  2.3. 1  with  strict  inequality  holding  throughout 

in  (b).  The  same  scheme  only  this  time  reducing  v  -(r-i)  by  one 

n  '  ' 

suffices  to  complete  the  proof  of  the  third  induction. 

But  this  completes  also  the  proof  of  the  induction  on  the  maximum 
score  in  the  n  th  set  of  numbers,  which  shows  that  the  theorem  is  valid 
for  n  sets  of  numbers,  which  completes  the  argument  for  the  original 
assertion,  by  induction. 

This  theorem  is  actually  only  a  special  case  of  a  more  general 
result  on  simultaneous  solutions  in  integers  to  set-theoretic  functions 
satisfying  certain  conditions. 

Corollary  2.3. 1.  A  set  of  integers  u^  <  u^  <  • • •  <  u^  is  the  score 


, 


.  . 


.  ■ 


. 
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sequence  of  some  ordinary  round-robin  tournament  if,  and  only  if, 


k 

i=l 

for  k  =  1,  2,  •••,  n,  with  equality  holding  when  k  =  n. 


This  was  first  proved  by  Landau  [6l],  without  using  the  concept 
of  irreducibility ,  and  follows  immediately  from  Theorem  2.3.1  upon  setting 

nl  =  n2  =  •••  =  nn  =  1  . 


Corollary  2.3*2.  There  exists  an  m  by  n  matrix  of  O’s  and  l’s 

with  row  sums  r.,  where  r,  <  r  <  <  r  ,  and  column  sums  c.  , 

l  1  ~  2  —  —  m  j 

where  ^  >  c  >  ...  >  c^,  if,  and  only  if 


for  k  =  0,  1,  ••*,  m  and  £  =  0,  •• 
k  =  m  and  £  =  n. 


c . )  >  ki  , 

J 

n,  with  equality  holding  when 


The  proof  of  this  involves  applying  Theorem  2.3.1  to  assert  the 

equivalence  of  the  hypothesis  and  the  existence  of  an  m  by  n  bipartite 

tournament  with  score  sequence  =  (r^,  ••*,  r^)  and  =  (m-c^,  •**,  m-c^) 

and  constructing  the  required  matrix,  A  =  ||  a„  ||  ,  by  letting 

a.  .  =  1  or  0  according  as  P,  .  — >■  P^ .  or  P^ .  -*P_.  ,  respectively, 
lj  li  2j  2j  li 

We  may  observe  that  in  checking,  for  fixed  k  ,  if 


I 


£ 


j=l 


>  k£  , 


i]  i 

-  * 

■■■  :  ■  » 

t  •  •  •  ^ 
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it  is  necessary  to  consider  only  those  £  for  which  m  -  c^  <  k  ,  for 
if  it  is  satisfied  for  these  values  it  will  automatically  hold  for  the 
remaining  cases.  A  similar  remark  may  be  made  for  fixed  values  of  £  . 

Given  two  sets  of  nonnegative  integers,  r^,  i  =  1,  • • • ,  m,  and 


V 

• 

• 

• 

r— 1 

II 

•I”) 

n,  with 

r.  <  r^  <  • • •  <  r  < 
1  —  2  “  —  m  — 

n  and 

m  >  c^  >  c^  > 

Let 

S .  denote 
J 

the  j  th 

column  sum  of  an  m 

by  n 

matrix  whose 

row 

consists  of 

r  l’s 

followed  by  n  -  r. 

1 

0’s  . 

A  theorem  dis- 

covered  independently  by  Gale  [28]  and  Ryser  [87]  states  that  the  r_^ 
and  0^  can  be  the  row  and  column  sums,  respectively,  of  an  m  by  n 
matrix  of  0’s  and  I’s  if,  and  only  if, 


ci + 


°2  + 


^  S1 


S2  + 


for  j  =  1,  ••*,  n,  with  equality  holding  for  j  =  n. 

Assume  that  a  given  set  of  r_^  and  c^  satisfy  the  conditions 

of  Corollary  2.3.2.  For  any  fiveif  value  of  £  ,  £  =  1,  ••*,  n,  let  k 

be  the  number  such  that  r,  <  £  and  r.  1  >  £.  If  no  such  k  exists 

k  -  k+1 

take  k  =  m  . 


Then 

(ri  +  r2  +  +  rk)  +  ((tn“c1)  +  (m“c2)  +  “•  +  (m-Cg))  >  k£  , 

for  any  suitable  value  of  £  ,  by  Corollary  2.3.2.  Or, 

(c!  +  c2  +  ...  +  c^)  <  (rx  +  r  +  •••  +  rk)  +  i(m  -  k) 

=  +  • • •  +  , 

appealing  to  the  definition  of  S.  and  k,  with  equality  holding  for 


£  =  n  . 


. 

o. 

' 

_ 
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Making  use  of  the  remark  following  Corollary  2.3*2  the  converse 
implication  may  be  demonstrated  in  a  similar  fashion.  Hence  the  Gale- 
Ryser  Theorem  and  Corollary  2.3.2  are  equivalent. 

2.4  The  number  of  bipartite  tournaments . 

The  determination  of  the  number  of  nonisomorphic  m  by  n  bi¬ 
partite  tournaments,  Bm  may  be  considered  as  a  special  case  of  the 
counting  problems  previously  considered.  It  was  described  in  §  1.3  how 
to  obtain  the  counting  series  for  the  number  of  nonisomorphic  oriented 
m  by  n  bigraphs.  The  coefficient  of  the  last  term,  x™1*,  is  clearly 
the  number  of  nonisomorphic  m  by  n  bipartite  tournaments. 


An  alternate  approach  is  to  observe  that  there  is  a  one-to-one 

correspondence  between  m  by  n  bipartite  tournaments  with  t  edges 

oriented  towards  points  in  Q  and  mn  -  t  edges  oriented  towards  points 

in  P  and  ordinary  m  by  n  bigraphs  with  t  edges.  Hence,  the 

total  number  of  nonisomorphic  m  by  n  bipartite  tournaments  is  the  sum 

of  the  coefficients  in  g  (x)  or  g  (l).  Rather  than  evaluate  this 

m,n  m,n 

counting  series  and  then  substitute  in  x  =  1  it  is  simpler  to  let  x  =  1 
when  first  substituting  the  figure  counting  series  into  the  cycle  index. 
This  gives  the  result  that 


(2  U  1)  B  =  Z(G  ;  2)  = 
m,n  m.n 


m,n 


m!n! 


,(m)  h(n) 

l(j)  V) 


aEW*b> 


where  the  summation  is  over  the  same  partitions  and  indices  as  described 
for  (I.3.4). 


00 


The 


first  few  terms  of  B(x,y)  =  ^>  B 


m,n=0 


m,n 


m  n 
x  y 


are  found  to 


.  . 

•  •  .  ■ .  i  lint 


<  i 


. 


- 
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be  as  follows: 


(2.4.2) 


The  form  of  P6lya's  theorem  used  in  (2.4.1)  which  counts  the  total 
number  of  inequivalent  configurations  regardless  of  their  content  was 
found  independently  by  Davis  [I3]  and  Slepian  [91].  For  a  further  discuss¬ 
ion  of  the  relationship  between  this  form  and  Pdlya's  original  version  see 
Harary  [39]. 

The  largest  term  in  (2.4.1),  corresponding  to  (j)  =  (m)  and 
(i)  =  (n),  gives  the  inequality  t 


(2.4.3)  B. 


m,  n 


It  seems  likely  that  for  large  m  and  n  and  with  some  restriction 
upon  their  relative  orders  the  term  retained  to  give  this  inequality 
represents  an  increasingly  larger  proportion  of  all  m  by  n  bipartite 
tournaments.  A  better  inequality,  resulting  from  considering  also  the 
partitions  (j)  =  (m-2,l)  and  {&)  =  (n-2,l)  for  m,  n  >  2,  is 


( 'f “  +  (")  2 


If  the  remaining  terms  are  to  become  negligible  in  comparison  with 
the  first  term  then  certainly  the  logarithm  of  m  must  become  negligible 


' 
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in  comparison  with  n,  and  vice  versa,  as  m  and  n  become  large. 

For  m  =  n  =  5  these  two  bounds  are  approximately  .42  and  .76, 
respectively,  of 

For  similar  remarks  with  respect  to  ordinary  tournaments  see 
Landau  [59]  and  Davis  [14],  For  more  general  approximations  and  refer¬ 
ences  to  other  relevant  papers  see  Harary  [4l]. 

It  should  be  observed  that  determining  the  number  of  nonisomorphic 
m  by  n  bipartite  tournaments  is  not  equivalent  to  determining  the 
number  of  different  score  sequences  for  such  tournaments,  for  while  having 
the  same  score  sequence  is  obviously  a  necessary  condition  for  two  bi¬ 
partite  tournaments  to  be  isomorphic  it  is  not  sufficient  as  may  be  seen 
by  the  example  to  be  given  in  §  2.12.  The  same  applies  to  ordinary 
tournaments,  as  pointed  out  by  Landau  [59]-  The  determination  of  the 
number  of  score  sequences  appears  to  be  unsolved,  although  Mohanty  and 
Narayana  [64]  claim  to  have  some  upper  bounds  for  the  number  of  score 
sequences  of  ordinary  tournaments  and  David  [12]  contains  tables  and 
frequencies  of  the  score  sequences  of  tournaments  involving  fewer  than 
nine  points,  among  other  things. 

An  obvious,  albeit  virtually  useless,  scheme  for  generating  all 
the  score  sequences  of  m  by  n  bipartite  tournaments  is  by  the 
exponents  of  the  quantities  in  the  terms  of  the  expansion  of  the  follow¬ 


,m 

>n 


ing  product: 


s  •  -  -  - 


* 

L 
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•  2.5  The  number  of  irreduc ible  bipartite  tournaments . 

By  determining  all  the  value!  of  k  and  &  for  which  equality 

holds  in  the  sum  of  Corollary  2.3.2  it  is  easily  seen  how  every  m  by 

n  bipartite  tournament  may  be  formed  from  irreducible  m^  by  n^ 

bipartite  tournaments,  U  n  ,  where  v  =  1,  2,  •••,  h,  h  a  positive 

v  v 

integer,  m^  +  m^  +  ...  +  m^  =  m,  and  n^  +  no  +  • • •  +  n^  =  n  ,  by 

letting  each  P  point  and  Q  point  in  U  be  joined  by  an  edge 

v  v 

to  each  Q  point  and  P  point,  respectively,  in  U  ,  •••,  U  , 

in  i  n  —  m.  n« 

v+1  v+1  n  h 

for  v  =  1,  2,  •**,  h  -  1,  which  is  oriented  towards  the  latter  point. 

Furthermore,  starting  with  a  given  m  by  n  bipartite  tournament  the 

graphs  U  v  =  1,  ••*,  h,  called  the  irreducible  bipartite  sub- 

v  v  ’ 

tournaments  of  the  original  one,  are  uniquely  determined  up  to  an  iso¬ 
morphism.  (If  the  original  graph  is  irreducible  then  h  =  1  and 

U  is  the  graph  itself).  The  converse  statement  is  valid  also. 

mini 

This  observation  enables  us  to  count  the  number  of  nonisomorphic 
irreducible  bipartite  tournaments  in  terms  of  the  total  number  of  non¬ 
isomorphic  bipartite  tournaments. 

Let  b  and  A  denote  the  number  of  nonisomorphic  m 

m,n  m,n 

by  n  bipartite  tournaments  which  are  irreducible  and  reducible, 
respectively,  and  let  their  corresponding  generating  functions  be 

b(x,y)  and  A(x,y).  We  recall  that  1  =  ^1  0  =  ^  an<^  ^0  i  =  ^i  0  =  ^  ’ 

for  i  =  2,  3,  • • • ,  from  the  definition  of  irreducibility  given  in 

§  2.3.  Then 


(2.5.1)  b(x,y)  +  A(x,y)  =  B(x,y) 


y 


. 
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where  B(x,y)  is  as  defined  in  the  preceeding  section,  since  every 
bipartite  tournament  is  either  reducible  or  irreducible,  but  not  both. 
Also 

(2.52)  B(x,y)  =  b(x,y)  +  b2(x,y)  +  b5(x,y)  +  •••  =  y^^fx'y)  ’ 

since  the  k  th  power  of  b(x,y)  ,  for  k  =  1,  2,  •••,  enumerates  those 
bipartite  tournaments  which  may  be  considered  as  having  been  formed  by 
combining  k  irreducible  bipartite  tournaments  according  to  the  scheme 
described  above. 


Solving  for  b(x,y)  gives 


(2.5.3)  b(x,y) 


1  +  B(x,y) 


which  could  have  been  derived  directly  by  use  of  the  principle  of  inclusion 
and  exclusion. 


From  (2.51)  and  (2.53)  we  have 


(2.5.4)  A(x,y) 


1  +  B(x,y)  =  b(x’y)  B(x'y) 


or  equivalently 


(2.5.5)  A(x,y) 


fa2(x,y) 

1  -  b(x,y) 


It  is  easily  seen  that  these  could  have  been  derived  in  other 


orders . 


By  restricting  the  number  of  terms  that  appear  in  b(x,y)  it  is 
not  difficult  to  count  bipartite  tournaments  in  terms  of  other  parameters, 
e.g.  in  terms  of  the  size  of  the  largest  irreducible  subtournament  they 
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contain. 

Similar  considerations  suffice  to  count  the  number  of  nonisomorphic 
irreducible  ordinary  tournaments  in  terms  of  the  total  number  of  ordinary 
tournaments,  which  may  be  regarded  as  known,  from  Davis  [14]  and  Harary 
[40] .  Counting  ordinary  tournaments  in  terms  of  the  largest  irreducible 
subtournament  contained  is  the  same  as  counting  them  in  terms  of  the 
longest  oriented  cycle  they  contain  since  for  ordinary  tournaments  irre- 
ducibility  is  equivalent  to  the  existence  of  an  oriented  cycle  through 
all  the  points  from  results  of  Camion  [6]  and  Roy  [86]. 

In  a  paper  on  weighted  compositions  of  integers  Moser  and  Whitney 
[68]  have  considered  certain  other  generating  functions  related  by  equa¬ 
tions  similar  to  (2.5.2)  and  (2.5.5)* 

2.6  The  probability  that  a  random  bipartite  tournament  is  irreducible. 

It  has  been  shown  elsewhere  [65]  that  if  Q(n)  denotes  the  prob¬ 
ability  that  an  ordinary  round-robin  tournament  on  n  points  chosen  at 

(g) 

random  from  the  set  of  2  such  tournaments  ijtse  reducible,  then 

(2.6.1)  |  Q(n)  -  ^n-1  |  <  ~rPT  ,  for  n  >  8. 

In  this  section  we  extend  the  methods  used  to  obtain  this  to  derive 

information  about  P(m,n),  the  probability  that  an  m  by  n  bipartite 

mn 

tournament  chosen  at  random  from  the  set  of  2  such  tournament  is 
irreducible.  P(l,0)  =  P(0,l)  =  1  and  P(i,o)  =  P(0,i)  =  0,  for 
i  =  2,  3,  •••,  by  definition. 


If  a  given  bipartite  tournament  is  reducible  then  it  contains  a 
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1  °'.: 
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maximal,  in  a  sense,  proper  subset  of  points  such  that  these  points 
together  with  the  edges  joining  them  form  an  irreducible  bipartite 
subtournament  and  all  edges  joining  points  not  in  this  subset  with 
points  which  are  in  it  are  oriented  towards  the  latter  points.  This 
is  a  consequence  of  the  considerations  in  the  first  paragraph  of  the 
preceeding  section  where  this  subgraph  was  denoted  by  U  n 

"V  h 

The  probability  that  this  subgraph  contains  k  P  points  and 
l,  Q  points,  l<k<m,  l<i<n,  but  not  both  k  =  m  and  i  =  n,  is 


(”)  <")  P(k,i)  2  M  2  (-*)("-*)/  2 

=  o  <")  p(k,i) 

2k(n-^)  2^(m-k)  ’ 

k  f, 

since  P(k,-g)  2  irreducible  k  by  i  tournaments  may 

each  of  the  (™)  (^)  subsets  of  k  P  points  and  S,  Q 
having  chosen  one  of  these  there  remain  only  (m-k)(n-i) 
sense  of  orientation  needs  to  be  chosen. 


be  formed  on 
points,  and 
edges  whose 


The  only  other  alternative  for  reducible  m  by  n  tournaments, 

m,  n  >  1,  is  that  there  exists  a  subset  of  points,  of  necessity  all 

belonging  to  the  same  set,  P  or  Q,  whose  scores  are  zero.  In  this 

case  U  would  not  necessarily  be  uniquely  determined,  although  it 

mhnh 

would  be  determined  up  to  an  isomorphism.  The  probability  of  this  being 


1  -  (1  -  l/2n)  m 


+ 


1  -  (1  -  l/2m)  n 


the  case  is 
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Combining  these  cases  and  summing  over  k  and  i  in  the  range 
described  above  gives  the  result  that 


(2.6.2)  P(m,n)  =  1 


1  -  (1  -  l/2n)m 


1  -  (1  -  l/2m)n 


-  Y  0  0  P(k^} 

— '  2k(n--0  2^(m-k) 

Since  P(k,i)  =0  if  k  or  £  equals  one  we  need  sum  only  over 

k  =  2,  .  ..,  m  and  St,  =  2,  •  •  •  ,  n,  but  not  both  k  =  m  and  &  =  n. 

Using  this  the  first  few  nontrivial  values  of  P(m,n)  are  found 
to  be  approximately  as  follows: 

(2.6.3) 

P(2,2)  =  .12500  , 

p(3,2)  =  .09375  ,  p(3»3)  =  -19921  , 

P(4,2)  =  .05462  ,  P(4,3)  =  .22140,  P(4,4)  =  .23624  , 

P(5,2)  =  .02921  ,  P(5>3)  =  .19867,  P(5,4)  =  .43096  ,  P(5,5)  =-76173 

As  the  computational  labor  involved  in  using  (2.6.2)  very  soon  , 
becomes  prohibitive  we  seek  some  estimates  for  the  value  of  P(m,n). 

We  make  the  further  restriction  that  m  =  n. 

The  following  inequality  is  easily  established. 


Lemma 

2.6. 1. 

wP  <  (m  - 

5) 

22(m-l) 

,  for  integers  m  >  4 

Lemma 

2.6.2. 

xsP  <  (m  - 

5) 

(m  -  t) 

in-  1+t  .  .  , 

2  ,  for  integers  m  >  4 

and  t  =  2,  3,  • • •  ,  m  "  2 
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When  m  =  4  it  may  be  verified  directly.  We  next  show  that  the 
inequality  is  valid  for  t  =  2  and  all  admissible  values  of  m  . 

Assume  that 

k^  <  (k  -  3) (k  -  2)  2^  +  ^  for  some  integer  k  >  4  . 

y 

Using  the  induction  hypothesis  and  obvious  inequalities  gives 


(k  +  1)^  =  k^  +  3k2  +  3k  +  1 

<  (k-j)(k-2)2k+1  + 


f  (^)(k-2)2k+1 


+  |  (k-3)(^)2k+1 


3(^)(^)2k+1 


+  1 


<  (k-3)(k-2)2k+1  +  (k-2)2k+i  +  (k-3)2k+E  + 


(k-3)(k-2)2k+1  +  2k+2 


(k  +  1)  -  3 


(k  +  1)-  2 


_k+2 


The  proof  of  the  lemma  will  be  completed  if  we  can  now  show  that 
the  right  hand  side  is  an  increasing  function  of  t,  for  2  <  t  <  m  -  2 

This  is  equivalent  to  showing  that  the  derivative  with  respect  to 
t  of  the  right  hand  side  is  positive  in  this  range,  or  that 


(m  -  3) 


„  m-l+t  ,  v  m-l+t  . 

-2  +  (m  -  t)  2  log  2 


>  0 


This  will  be  so  if 


log  2  ’ 


m  -  t  > 


+ 


v  !  >. 


-V  -I- 


v  4 


.  *.  / 
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+ 
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*  / 
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or 

1 

t  <  m  -  - - -  , 

log  2 

which  is  certainly  the  case  from  the  restrictions  on  t.  This  completes 
the  proof  of  the  lemma. 

z. 

We  now  extend  this  bound  for  nr  to  include  another  variable. 

-z  \+Z 

Lemma  2.6.3  nr  <  (m  -  3)(m  "  j)(m  -  Z>)  2J  ,  for  integers  m  >  4  , 
and  j ,  =  2,  . . . ,  m  -  1  . 


The  inequality  is  easily  verified  when  m  =  4  and  all  admissible 
j  and  Z  .  Assume  that  it  has  been  demonstrated  for  all  values  of  m 
up  to  k  >  4,  and  all  admissible  values  of  j  and  Z  for  each  such 
values  of  m.  Then  for  fixed  j  and  Z  between  2  and  k  -  1  it 
follows  that 


(k+l) 


9  =  k5  +  3k ^  +  3k  +  1 


<  (k-3)(k-j)(k-£)2J+i  + 


+  (k-3)  (^)  (k  -  Z)  +  (k-3)(k-j)(i:~) 


,kliN  (k-j)(k-i) 
v  k  ; 


,k -Z- 


J+i 


(k-3)  +  (^2)(k-j)(!^)  + 


j*.  1 


< 


(k-3)(k-j)(k-J)  +  (k-j)(k-i)  +  (k-3)(k-i) 


+  (k-3) (k- j )  +  (k-3)  +  (k-j)  +  (k-i>l 


J+i 


(k+1)  -  3 


(k+1)  -  j 


(k+1)  -  Z 


J+& 


The  only  values  of  j  and  Z  ,  in  the  case  m  =  k  +  1,  not 
included  in  this  inductive  argument  are  when  one  of  j  or  Z  but  not 


*  , 

'  ~r  r 
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both  equals  k,  in  which  case  the  result  follows  from  Lemma  2.6.2,  or 
when  j  =  i  =  k,  in  which  case  the  result  follows  from  Lemma  2.6.1.  This 
completes  the  proof  of  the  lemma  by  induction. 


Lemma  2.6.4 


(”)  (") 


m 


(m  -  l)(m  -  2) 


for  integers  m  >  16,  and  j,  i  =  2,  . . . ,  m,  but  not  j  =  i  =  m 


Straightforward,  but  tedious,  calculations  suffice  to  show  that 
the  inequality  is  valid  for  m  =  16  and  all  admissible  j  and  i  . 

Assume  that  it  has  been  verified  for  all  values  of  m  up  to 
k  >  16  and  all  admissible  values  of  j  and  l  for  each  value  of  m. 
Let  j  and  &  be  fixed  between  2  and  k  but  not  with  both  equalling 
k .  Then 


(k  *  b  (k  \  6 

2j(k+i-i)2^(k+i-j) 


_ (k  +  Dg _ 

( k+l-j)(k+l-£)2T+ 


(j)  0 


2j(k-i)2i(k-iy 


< 


(k  +  if 

(k+l-j)(k+l-i)2J+ 


1 

k(k- l) (k-2) 


< 


(k  +  if 

(k+1 f  k(k-l) 


_ 1 _ 

=  (k+l)[(k+l)-l][(k+l)-2] 


using  the  induction  hypothesis  and  Lemma  2.6.3* 


The  remaining  values  that  j  and  l  may  assume,  when  m  =  k  +  1, 
for  which  the  validity  of  the  lemma  is  not  established  in  this  argument 


are  as  follows: 
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(i)  j  =  £  =  k. 


Verifying  the  inequality  in  this  case  is  equivalent  to  showing 
5  2k 

that  (k+l)  (k)(k-l)  <  2  ,  for  k  >  16,  which  is  easily  seen  to  be 

valid  for  k  >  8. 

(ii)  j  =  k  +  1  and  l  =  2,  k  . 


To  show  the  correctness  of  the  lemma  when  j  =  k  +  1  and  St  =  k 
is  equivalent  to  showing  that  (k+l)^(k) (k-l)  <  2k+^  ,  for  k  >  16, 
which  also  may  be  done  in  a  straightforward  manner. 


Since 


(k+1) 

li+l; 


fk+1)  [(k+l)-(jB+l)  ] 


2k+1(k+l->g)  . 
i  +  1 


(k+1) 

^  l  1 


fk+l)(k+l-^) 


for  i  =  2, . . . ,k-l  , 


and 


2k+1  (k+l-l) 

H  +  1 


>  1 


for  k  >  16  , 


it  follows  that  in  this  case  the  left  member  of  the  inequality  is  an 
increasing  function  of  l  for  fixed  k.  Hence  its  maximum  over  the 
range  under  consideration  occurs  at  St,  -  k  where  the  validity  of  the 
inequality  has  already  been  established. 


Appealing  to  symmetry  to  treat  the  remaining  cases  completes  the 
proof  of  the  lemma  by  induction. 

Since  P(k ,£)  <  1  and  the  sum  in  (2.6.2)  contains  m(m-2)  terms 
Lemma  2.6. k  may  be  used  to  give  the  following  result: 

Theorem  2.6.1  2(l-l/2m)m  -  1  -  ^  C  P(m,m)  <  2(l-l/2m)m  -  1, 


for  m  >  16. 
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This  can  be  used  to  derive  an  asymptotic  estimate  for  P(m,m) 
and,  in  particular,  shows  that  P(m,m)  tends  to  one  as  m  tends  to 
infinity. 

In  the  next  section  a  partial  answer  will  be  given  to  the  question 

of  for  what  relative  orders  of  m  and  n  is  it  still  true  that  P(m,n) 

tends  to  one  as  m  and  n  tend  to  infinity.  That  this  is  not  always 

1  n 

the  case  is  easily  seen.  For  an  extreme  example  let  m  >  log  (  —  )  2  , 
where  0  <  €  <  1  ,  as  m  and  n  tend  to  infinity.  Then 

P(m,n)  <  (1  -  l/2n)m  +  (1  -  l/2m)n  -  1 

m 

/.  ,  ,  n.m  ~  p11  log  e 

<(1-1/2)  <  e  ^  <  e  °  =  €  . 

Moser  [67]  has  shown  that  if  the  maximum  score  of  any  point  in 
an  ordinary  tournament  on  n  points  is  less  than  or  equal  to  3/4(n-l) 
and  if  the  minimum  score  is  greater  than  or  equal  to  l/4(n-l)  then  the 
tournament  is  irreducible. 

For  bipartite  tournaments  there  is  the  following  similar  result: 
Theorem  2.6.2.  If  the  scores  of  the  P  and  Q  points  of  an  m  by  n 
bipartite  tournament  all  lie  strictly  between  n/4  and  Jn/4,  and  rtl/k 
and  3m/4  ,  respectively  then  the  tournament  is  irreducible. 

The  proof  will  be  by  contradiction.  If  the  bipartite  tournament 
is  reducible  then  its  points  may  be  separated  into  two  mutually  exclusive 
and  exhaustive  classes,  A  and  B,  such  that  every  edge  which  joins  a 
point  in  A  to  a  point  in  B  is  oriented  towards  the  latter  point.  Let 
B  contain  k  P  points  and  l  Q  points,  and  A  contain  m  -  k  P 
points  and  n  -  l  Q  points  where  k  and  i,  considering  nontrivial 
cases  only,  are  integers  such  that  1  <  k  <  m  -  1  and  1  <  l  <  n  -  1  . 
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.  H*h,or 

If  k  <  m/b  ,  k  >  Jjm/b  y/( £  >  Jn/b  then  it  is  easily  seen  that 
some  score  violates  the  hypothesis  of  the  theorem.  If  both  k  <  m/2 
and  £  <  n/2  then  the  minimum  score  is  certainly  less  than  or  equal  to 
k/2  <  m/4  ,  if  it  belongs  to  a  Q  point,  or  £/2  <  n/b  ,  if  it  belongs 
to  a  P  point,  contradicting  the  hypothesis  of  the  theorem  in  either 
case.  A  similar  contradiction  results  from  considering  the  maximum 
score  when  both  k  >  m/2  and  £  >  n/2  . 

By  symmetry  there  remains  to  consider  only  the  case  where,  say, 
m/2  <  k  <  y&/b  and  n/4  <  £  <  n/2  ,  but  not  both  k  =  m/2  and  £  =  n/2 

Assuming  that  the  scores  satisfy  the  hypothesis  of  the  theorem  it 
follows  that  the  total  number  of  edges  joining  two  points  in  A  is 
strictly  less  than 

{yn/k  -  k)(n  -  £)  +  (3n/l+  -  i)(m  -  k)  . 

If  this  quantity  can  be  shown  to  be  less  than  or  equal  to 
(m  -  k)(n  -  £)  ,  the  actual  number  of  edges  joining  two  points  in  A  , 
a  contradiction  will  have  been  derived. 

Similarly,  still  assuming  that  the  scores  satisfy  the  hypothesis 
of  the  theorem,  the  total  number  of  edges  joining  two  points  in  B  is 
strictly  greater  than 

kn/4  +  £m/b 

If  this  quantity  can  be  shown  to  be  greater  than  or  equal  to  k£ 
the  actual  number  of  edges  joining  two  points  in  B  ,  another  contra¬ 


diction  will  have  been  derived. 
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The  inequalities  which  we  would  like  to  verify  under  the  condi¬ 
tions  on  k  and  £  become,  upon  simplifying  and  rearranging: 

(2.6.4)  n(m/2  -  Jk/if.)  <  £(3 m/4  -  k)  , 

and 

(2.6.5)  m/4  >  k  (£  -  n/4) 

Replacing  £  by  n/4  in  (2.6.4)  the  resulting  inequality  is 
seen  to  be  valid  when  k  >  5^/8  ;  hence  if  n/4  <  £  <  n/2  and 
5m/8  <  k  <  3m/4  (2.6.4)  holds,  implying  a  contradiction.  We  need 

now  consider  only  the  case  where  m/2  <  k  <  5/8  m  . 

Replacing  k  by  5/8m  in  (2.6.5)  the  resulting  inequality  is 
seen  to  be  valid  when  £  <  5n/l2  ;  hence  if  n/4  <  £  <  5n/l2  and 
m/2  <  k  <  5m/8  (2.6.5)  holds,  implying  a  contradiction.  We  need  now 

consider  only  the  case  where  5n/l2  <  £  <  n/2  . 

The  new  lower  bound  for  £  is  now  substituted  into  (2.6.4)  again 
and  the  resulting  inequality  is  valid  for  k  >  9 m/l6  ,  so  henceforth  we 
may  assume  that  m/2  <  k  <  9m/l6.  Using  this  bound  for  k  in  (2.6.5) 
we  find  that  the  resulting  inequality  is  now  valid  for  £  <  9n/20  ,  so 
henceforth  we  may  assume  that  ^n/20  <  £  <  n/2  . 


By  induction  it  follows  that  after  the  t  th  iteration  of  this 


process  (2.6.4)  will  have  been  established  for  n/4  <  £  <  n/2  and 
4t  +  1 


8t 


m 


m 


<  k  <  Jm/4  and  (2.6.5)  will  have  been  established  for 

4t  +  1 


1/2  <  k  <  Jm/4  and  n/4  <  £  <  ^  ^  •  n  ,  which  will  suffice  to  prove 

the  theorem  for  these  cases. 


For  fixed  m  and  n  only  a  finite  number  of  such  iterations  are 
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necessary  to  bring  the  above  bounds  on  k  and  l  ,  for  which  the 
theorem  is  valid, to  within,  say,  a  distance  of  one-half  from  m/2  and 
n/2  ,  respectively.  Having  already  disposed  of  the  case  where  k  =  m/2 
and  l  =  n/2  this  suffices  to  complete  the  proof  of  the  original  assert¬ 
ion  since  k  and  l  are  integers. 

2.7  On  the  diameter  of  a  bipartite  tournament . 

If  a  and  b  are  two  distinct  points  in  an  oriented  graph  then 

by  the  distance  from  a  to  b  is  meant  the  length  of  the  shortest 
oriented  path  from  a  to  b  ,  where  the  length  of  a  path  is  the  number 

of  edges  it  contains.  If  no  such  path  exists  we  shall  say  that  the  dis¬ 

tance  is  infinite.  The  distance  from  a  to  b  is  not  necessarily  the 
same  as  the  distance  from  b  to  a  under  this  definition.  By  the 
diameter  of  an  oriented  graph  is  meant  the  maximum  value  of  the  distance 
between  any  two  distinct  points  of  the  graph.  From  Lemma  2.3.2  it  follows 
that  the  diameter  of  a  reducible  n-partite  tournament  is  infinite. 

For  some  results  on  the  diameter  of  ordinary  directed  graphs  see 
Ghouila-Houri  [29]  . 

Moser  [67]  has  shown  that  the  probability  that  an  ordinary  tourn¬ 
ament  on  n  points  has  diameter  two  tends  to  one  as  n  tends  to  infinity 
furthermore,  if  k  r  o(n/log  n)  ,  as  k  and  n  tend  to  infinity,  then 
a  random  tournament  on  n  points  will  have  more  than  k  paths  of  length 
two  joining  every  pair  of  distinct  points  with  probability  tending  to  one. 
Nevertheless,  there  exist  tournaments  with  finite  diameters  as  large  as 
n  -  1  ,  for  n  =  3  ,  4  ,  . . .  .  In  this  section  some  related  results  are 
derived  for  bipartite  tournaments. 


Since  some  of  the  statements  of  the  type  we  would  like  to  prove 
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for  m  by  n  bipartite  tournaments  are  not  necessarily  true  when  there 
is  an  extreme  disparity  in  the  relative  magnitudes  of  m  and  n  we 
shall  make  the  restriction  for  the  remainder  of  this  section  that  m  and 
n  tend  to  infinity  in  such  a  way  that 

(2.7.1)  log  m  =  o(n)  and  log  n  =  o(m) 

In  the  first  place  it  is  easily  seen  that  the  probability  of  there 
being  any  point  of  indegree  zero  in  a  random  m  by  n  bipartite  tourna¬ 
ment  is  less  than  or  equal  to  m/2n  +  ,  which  tends  to  zero  as  m 

and  n  tend  to  infinity  while  satisfying  (2.7.1) ♦ 


In  the  second  place  the  probability  that  in  a  random  m  by  n 

tournament  there  exists  two  distinct  points  in  P,  P.  and  P.  ,  such 

1  J 

that  there  is  no  path  of  length  two  from  P^  to  P^.,  i.e.  no  point 

Q  such  that  P.  Q  and  Q  -»  P .  ,  is  at  most 
Hj  i  Jo  & 


,  ,  x  n  mn.-2n 

m(m-l)  5  2 _ 

mn 


m(m-l)  (j/4)n  , 


since  there  are  m(m-l)  ways  of  choosing  the  ordered  pair  of  P  points, 
three  admissible  ways  of  orienting  the  two  edges  joining  these  two  points 
to  each  of  the  n  Q  points,  and  two  ways  of  orienting  each  of  the  re¬ 
maining  mn-2n  edges.  But  this  quantity  also  tends  to  zero  as  m  and  n 
tend  to  infinity  while  satisfying  (2.7. l).  This  and  a  corresponding 
argument  with  the  two  P  points  replaced  by  two  Q  points  implies  that 
the  distance  betwen  any  two  distinct  points  both  of  which  are  in  the  same 
point  set  of  a  bipartite  tournament  is  two,  with  probability  tending  to 


one . 
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Since  it  is  impossible  for  a  bipartite  tournament  to  have  diameter 
equal  to  one  or  two,  the  observations  in  the  two  preceeding  paragraphs 
imply  the  following  theorem. 

Theorem  2.7.1.  The  probability  that  an  m  by  n  bipartite  tournament 
has  diameter  three  tends  to  one  as  m  and  n  tend  to  infinity  while 
satisfying  (2.7. l). 

Corollary  2.7. 1.  The  probability  that  an  m  by  n  bipartite  tournament 
is  irreducible  tends  to  one  as  m  and  n  tend  to  infinity  while  satis¬ 
fying  (2.7.1). 

This  provides  a  partial  answer  to  the  question  mentioned  in  the 
preceeding  section  following  Theorem  3.6. 1,  and  is  an  immediate  consequence 
of  Theorem  2.7. 1  and  the  earlier  remark  that  the  diameter  of  a  reducible 
tournament  is  infinite. 

The  probability  that  there  are  not  more  than  k  paths  of  length 
two  joining  every  ordered  pair  of  P  points,  P_^  and  P^  ,  is  at  most 

m(m-l)  /  n  v  .  k  n-k  m  n-2  n  ,  ,x/nw_/,  \n-k 

■  gW-  (  k  )  If  3  2  =  m(m-l)  (  k  )(  3/A) 

by  a  slight  extension  of  the  previous  considerations.  It  is  not  difficult 
to  see  that  this  tends  to  zero  as  m  and  n  tend  to  infinity  while 
satisfying  (2.7. l)  if  k  =  o(n/log  n).  A  corresponding  result  applies 
when  the  two  P  points  are  replaced  by  two  Q  points,  providing  a 
property  of  bipartite  tournaments  similar  to  that  of  ordinary  tournaments 


mentioned  above. 
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Deeper  considerations  would  presumably  enable  one  to  make  stronger 
statements  concerning  the  number  of  paths  of  length  three  which  join  two 
points  not  in  the  same  point  set  of  a  bipartite  tournament. 


2.8  A  hierarchy  index  for  bipartite  tournaments . 

In  studying  the  structure  of  animal  societies  in  which  a  non¬ 
transitive  dominance  relation  exists  between  every  pair  of  distinct 
members,  such  as  the  peck-right  in  a  flock  of  hens,  Landau  [59] >  [6o] 
has  considered  what  he  called  a  hierarchy  index  which  provides  a  measure 
of  how  closely  the  society  resembles  a  hierarchy  with  respect  to  the 
dominance  relation.  A  hierarchy  with  respect  to  a  given  dominance  relation 
is  a  society  whose  members  can  be  so  ordered  that  each  member  dominates 
all  those  who  are  beneath  him  in  the  ordering  and  is  dominated  by  all 
those  who  are  above  him  in  the  ordering.  An  ordinary  tournament  provides 
a  model  of  such  a  structure  and  in  this  section  we  consider  an  analogue 
of  Landau's  hierarchy  index  for  bipartite  tournaments  although  a  natural 
interpretation  for  the  situation  here  is  perhaps  not  as  readily  available 
as  that  for  ordinary  tournaments. 


In  a  nontrivial  m  by  n  bipartite  tournament  in  which  the  scores 

of  the  points  P.  and  Q.  are  denoted  by  v_^  and  u.  ,  respectively, 

i  =  1,  . ..,  m  ,  j  =  1,  ...,  n  ,  the  hierarchy  index  ,  h  ,  is  defined  by 

m  n 


(2.8.1) 


h 


k 

m  n  (m+n) 


This  index  ranges  from  zero,  when  all  v_^  = 
which  can  happen  only  if  both  m  and  n  are  even, 


n/2  and  all 
and  one,  when 


u.  =  m/2 
J  / 

either 
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all  of  the  edges  are  oriented  towards  points  of  Q  or  they  are  all 

oriented  towards  points  of  P  .  This  situation  corresponds  to  a 

4 

hierarchy  in  the  ordinary  case.  The  factor  - - \  is  chosen  so 

m  n  (,m+n ) 

as  to  make  the  index  have  the  value  one  in  the  latter  case. 


We  shall  determine  the  mean  and  variance  of  h  under  the 
assumptions  given  in  §  2.1  . 


Denoting  the  mean  and  variance  of  a  random  variable,  y  ,  by 


E(y)  and  cr  (y)  ,  respectively,  it  follows  that 


m 


E(h)  = 


m  n  (m+n) 


i=l 


(v.-n /2)‘ 


•  £  ■ 

j=l 


(u_.-m/2)' 


4 


m  n  (m+n) 


■|  m  E  (VjL  -  n/2)‘ 


+  n  E 


(u£  -  m/2)' 


since  the  expectation  of  a  sum  equals  the  sum  of  the  expectations  of  the 

terms  in  the  sum  and  the  quantities  in  the  same  sum  in  the  above  expression 

.2 


are  equal.  But  E 


(v!  "  n/2) 


is  simply  the  variance  of  the  sum  of 


n  independent  random  variables  which  have  the  same  binomial  distribution 

2 


with  mean  l/2  ,  and  is  therefore  n/k.  Similarly  E  (u^  -  m/2)‘ 
Substituting  these  into  the  above  equation  gives 


=  m/k. 


(2.8.2)  E  (h)  =  2/(m  +  n)  . 

The  determination  of  the  variance  is  somewhat  more  involved  Qn£i 
may  proceed  as  follows: 

a2(h)  =  E  (h2)  -  E2(h) 


-  7^  - 


12 


mn(m+n) 


E 


JE,  JL  p-, 

.(£  (vn/2)2  +  I(ur”/a)2) 

i=l  j=l 


-  E2(h) 


12 


mn(nH-n) 


jtn  E[(v1-n/2)4]  +  id(id-I)  E  [(v1-n/2)2(v2-n/2)2] 


+  2mn  E[(v^-n/2)2  (u^-m/2)2] 

+  n(n-l)  E[(u1-tn/2)2(u2-m/2)2J  +  n  E[(u1-m/2)lf']| 

-  E2(h)  , 

upon  expanding  the  first  expression,  taking  expectations,  and  combining 
those  terms  which  are  equal. 


E[(v^-n/2)^]  =  n(3n-2)/l6.  (See  e.g.  Kendall  and  Stuart  [56], 


p.  121.) 


E[(v^-n/2)2(Vg-n/2)2]  =  E2[(v^-n/2)2]  =  n2/l6  ,  since  the 
orientations  of  the  edges  incident  upon  are  independent  of  the 

orientations  of  the  edges  incident  upon  P2> 

Finally,  E[(v1-n/2)2  (u^m/2)2]  =  E[(v1-n/2)2]  •  E[  (u.^m/2)2]  =  mn/lo. 
One  way  of  establishing  this  is  to  replace  v^  and  u^  by  v.  +  5^  and 
ul  +  ,  respectively,  where  6^  is  one  or  zero  according  as  to  whether 

P^  ->  Q1  or  -* P^  ,  respectively,  and  =  1  -  5^  .  v^  represents 
the  score  of  P^  with  respect  to  the  points  Q^,  . ..,  Qq  and  a 
corresponding  remark  applies  to  u^,  The  statement  follows  upon 
expanding  the  resulting  expression,  observing  that  v^  and  u^  are 
independent  of  each  other,  that  E(&x)  =  E(S2)  =  l/2  and  that  E(&x  =  0  , 
and  making  use  of  known  values  of  the  moments  of  the  binomial  distribution. 
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Substituting  these  values  and  those  obtained  by  symmetry  into 
2 

the  expression  for  cr  (h)  and  simplifying  one  has  the  following  result: 

(2.8.J)  cr2(h)  =  . 

(m+n)  mn 

By  Chebyshev's  inequality  it  follows  that  as  m  and  n  tend  to 
infinity  the  probability  that  h  differs  very  much  from  zero  tends  to 
zero. 


More  general  situations  in  which  the  probabilities  that  P_^  — 
and  Q^.  — *  P_^  are  denoted  by  p  and  q  ,  respectively,  where 

0  <  Pij  <  1  and  Pij  +  qji  =  1  >  f°r  1  =  l>  •  ••>  m  ^  3  =  l>  ••  •>  n  > 

may  be  considered.  In  this  case  the  mean  and  variance  of  h  may  be 

calculated,  in  principle  at  least,  by  using  characteristic  functions. 

Other  quantities  could  be  used  to  define  a  hierarchy  index  but 
that  in  (2.6.1)  appears  to  be  as  simple  as  any.  For  example,  the  m/2 
and  n/2  appearing  in  the  sums  of  (2.8.1)  could  both  be  replaced  by 

mn/(m+n)  and  the  factor  in  front  modified  so  that  the  resulting 

expression  still  varies  between  zero  and  one.  However  the  mean  and 
variance  now  become  quite  complicated  functions  of  m  and  n  and  in 
particular  don't  necessarily  tend  to  zero  as  m  and  n  tend  to  infinity. 
Further  results  about  the  distribution  of  h  will  arise  from  a  somewhat 
different  context  in  the  next  section. 

We  conclude  this  section  with  a  simple  example  of  the  distribution 
of  h  when  ro  =  3  and  n  =  2  under  the  hypothesis  of  randomness 
mentioned  earlier.  Since  h  is  a  function  of  the  scores  the  various 
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score  sequences,  the  probability  that  a  random  3^2  bipartite 
tournament  has  such  a  score  sequence,  and  the  corresponding  values  of 
h  are  listed,  from  which  the  expected  value  and  variance  of  h  may  be 
calculated  for  this  case. 


Distribution  of  h  for  m  =  3  and  n  =  2 


v  =  (vx,- 

v2’v^’  u  =  (ur  1 

u^)  2°* Probability  of  V  and 

U  lp  h 

(2,2,2); 

(0,0) 

1 

15 

(1,2,2); 

(0,1) 

6 

9 

(0,2,2); 

(1,1) 

* 

7 

1 

(1,1,2); 

(1,1) 

6 

3 

(1,1,2); 

(0,2) 

6 

7 

(0,1,2); 

(1,2) 

12 

5 

✓ 

(1,1,1); 

(0,3) 

2 

9 

(1,1,1); 

(1,2)  > 

S' 

0 

1 

(0,1,1); 

(1,3) 

6 

7 

(0,1,1); 

(2,2) 

6 

3 

(0,0,2); 

(2,2) 

3 

7 

(0,0,1); 

(2,3) 

6 

9 

(0,0,0); 

(3,3) 

1 

15 

E(h)  = 

2/5 

II 

x: 

1/25 

2.9  On  a  measure  associated  with  a  bipartite  tournament 

In  this  section  we 

consider  another  quantity  which  provides,  in 

a  sense, 

a  measure  of  how 

nearly  alike  the  various  points  of 

a  bipartite 

tournament  are  with  respect  to  the  orientations  of  the  edges  incident 
upon  them.  This  quantity  will  admit  of  a  perhaps  more  intuitive 
interpretation  than  did  the  hierarchy  index  but  we  will  find  that  the 


two  are  very  closely  related 
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It  will  be  convenient  to  introduce  first  the  definition  of  the 
adjacency  matrix  of  a  bipartite  tournament,  essentially  given  already 
in  the  proof  of  Corollary  2.3.2. 


Given  an  m  by  n  tournament  whose  point  sets  are 
P  =  ^Pl>  P2J  •••>  Pm^  and  ^  =  ^1'  •••>  ^  '  by  the  adjacency 

matrix  of  this  tournament  is  meant  the  m  by  n  matrix,  A  =  |ja_||  , 

where  a.,  is  one  or  zero  according  as  to  whether  P.  -> Q .  or 
ij  i  J 

-*  P_^  ,  respectively.  Later  we  shall  need  the  n  by  m  matrix. 


C  =  Ilcj^||  >  where  c^  is  one  or  zero  according  as  to  whether  — >  P^ 

or  P^  -> Q  ,  respectively.  Clearly,  A  plus  the  transpose  of  C 
equals  the  m  by  n  matrix  all  of  whose  entries  are  ones. 


The  binary  distance,  b^  ,  between  the  distinct  points  P^ 
and  P ^  of  a  bipartite  tournament  is  defined  as  follows? 


n 


(2.9.1) 


ki 


a.  •  “  a  n- 

1  kj 


j=l 


where  A  =  ||a_||  is  the  adjacency  matrix  of  the  tournament  as 
described  above,  b  .  is  simply  the  number  of  points  Q.  such  that 

Ki/  1 

the  edges  joining  P.  and  P.  to  Q.  have  different  orientations. 

K  i/  1 

The  term  "binary  distance"  is  used  because  the  definition  of  b^  is 
essentially  the  same  as  the  definition  of  the  distance  between  two 
binary  words  in  information  theory.  (See  e.g.  Peterson  [73];  P.  7.) 


g  ,  the  binary  distance  between  the  distinct  points  Q  and 

S  L  S 


Q  ,  is  similarly  defined  by 
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IB 


(2.9.2)  8st  =  ^  I 


IS 


ait 


i=l 


In  an  id  by  n  tournament  let 
distance  between  any  two  distinct  P 


b  denote  the  minimum  binary 
points. 


The  number  of  m  by  n  bipartite  tournaments  having  b  =  0  ,  or 

equivalently,  the  number  of  m  by  n  matrices  of  l's  and  0's  having 

,  .  ,  /BK  _n  _n(m-2) 

at  least  two  rows  identical,  is  at  most  {g )  2  2  ,  since  for 

each  of  the  (™)  choices  of  two  distinct  rows  there  are  2°  ways  of 

causing  them  to  be  identical  and  there  remain  n(ro-2)  entries  to  be 

chosen  either  zero  or  one.  The  ratio  of  this  number  to  2mn  ,  the  total 

number  of  m  by  n  tournaments  is  less  than  ro  /2  which  tends  to 

zero  provided  that  m  and  n  satisfy  (2.7.1)  as  they  tend  to  infinity. 


More  generally,  the  number  of  m  by  n  bipartite  tournaments 
having  b  <  k  is  less  than  or  equal  to  the  number  of  m  by  n  adjacency 
matrices  having  two  rows  whose  entries  differ  in  at  most  k  places 
which  is  at  roost  (™)  2^  (^)  2^  2n^m  ^  ,  since  having  chosen  the  two 
rows  in  one  of  (™)  ways  and  filling  out  the  entries  of  one  of  them 
in  2n  ways  we  may  then  choose  in  one  of  (^)  ways  the  k  columns  in 
which  the  other  row  may  possibly  differ,  the  remaining  n-k  entries 
being  determined  by  the  corresponding  entries  in  the  row  already  filled 
out.  The  entries  in  these  k  positions  in  the  second  row  may  be 

chosen  in  one  of  2^  ways  and  there  still  remain  n(m-2)  entries 

which  may  be  zero  or  one.  The  ratio  of  this  number  to  the  total 

number  of  m  by  n  tournaments  is  less  than  m^  n^/2n+'*'  \  As  m  and  n 
tend  to  infinity  while  satisfying  (2.7.1)  it  is  not  difficult  to  see 
that  this  quotient  tends  to  zero  if  k  =  o(n/log  n).  These  observations 
suffice  to  prove  the  following  result! 


:  -  ■  ; 

n  -  ■ 


-  •  .  •  : 

- 

_  ,  1 

.  ..  "! 

-  »  . 


. ' 


. 
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Theorem  2.9.1.  If  k  =  o(n/log  n)  the  probability  that  the  minimum 
binary  distance  between  two  distinct  P  points  of  an  m  by  n  bipartite 
tournament  is  less  than  or  equal  to  k  tends  to  zero  as  m  and  n  tend 
to  infinity  while  satisfying  (2.7.1). 

By  symmetry  similar  results  hold  with  respect  to  the  minimum 
binary  distance  between  two  distinct  Q  points.  Also,  the  same  type 
of  argument  may  be  used  to  derive  corresponding  results  on  the  maximum 
binary  distance  between  two  points. 

A  quantity,  D  ,  which  provides  a  measure  of  the  degree  of 
similarity  subsisting  between  the  points  of  an  m  by  n  bipartite 
tournament  may  be  defined  as  follows! 


(2.9.5) 


where  the  sums  are  over  all  unordered  pairs  of  integers  between  1 
and  m  and  1  and  n  ,  respectively.  An  immediate  consequence  of 
the  definitions  of  the  terms  involved  is  that  D  =  0  if,  and  only  if, 
all  the  entries  of  the  incidence  matrix  associated  with  the  bipartite 
tournament  are  the  same. 


As  before  let  v.  and  u.  denote  the  scores  of  points  P.  and 

i  j  i 

,  respectively,  in  an  m  by  n  tournament,  for  i  =  1,  . ..,  m  and 
j  =  1,  . ..,  n.  Then  the  ith  row  sum  and  the  jth  column  sum  of  the 


incidence  matrix  of  such  a  tournament  are  v^  and  m“uj  )  respectively. 
Hence,  the  ith  row,  containing  v.  l7s  and  n-v.  O’ s  ,  contributes 
v^(n-v^)  to  2  g  and  similarly  the  jth  column  contributes  u ^ (m-u^ ) 

to  2  (v  Summing  over  the  rows  and  columns  gives  an  alternat  ive 

K  l 


expression  for  D 


»  • 

' 

' 

. 

.  - 

. 

.  .. 

' 

. 

a...  ;i  . 
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(2.9.4) 


D 


TT) 


i=l 


) 


u  .  (ro-u  .  ) 

y  y 


In  this  form  it  is  easily  seen  that  the  maximum  value  D  may 
assume  occurs  when  the  v^  and  u^  are  all  as  nearly  equal  to  n/2 
and  m/2  ,  respectively,  as  is  possible.  Straightforward  considerations 
for  the  various  cases  imply  the  following  results! 


Theorem  2.9.2. 


m  n 


(m+n) 


ron-1 


(m+n) 


D  <  U 


m  n  /  \  m 

"Tj7  (ro+n^  "  4  > 
—  (m+n'  "  5  ' 


if  m  =  n  5  0(2)  ; 

if  m  =  n  =  l(2)  * 

if  m  =  0(2)  and  n  =  l(2)  * 

if  m  =  l(2)  and  n  =  0(2)  . 


Rather  obvious  examples  can  be  constructed  for  which  equality 
holds  for  each  possibility. 


Expanding  the  right  member  of 

ro  n. 

(m+n)  h  =  2^  (y^-n/2)2  +  ^  (uj"m/2)2 

i=l  j=l 

and  comparing  it  with  (2.9.4)  shows  that 


(2.9.5)  (m+n)  h  +  D  =  Sjp  (m+n)  . 


.  ■ 

’ 


’ 

= 
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In  other  words  the  hierarchy  index,  as  defined  in  (2.8.1), 
apart  from  a  constant  factor  for  fixed  values  of  m  and  n  ,  and  D  , 
as  defined  in  (2.9.3)  >  measure  the  same  thing  except  for  a  difference 
in  origin,  as  it  were.  Hence,  the  results  of  §2„8  can  be  adapted  to 
apply  to  D  and,  conversely,  what  we  shall  presently  show  about  the 
distribution  of  D  can  be  modified  to  give  corresponding  results 
about  the  distribution  of  h  in  perhaps  an  easier  manner  than  it 
would  have  been  to  have  derived  them  for  h  directly. 

Since  the  expected  values  of  each  of  the  terms  in  the  sums  in 
(2.9.3)  are  n/2  and  m/2  ,  respectively,  it  follows  that 

(2.9.6)  E(D)  =  HLH  (m+n-2 )  , 
under  the  hypothesis  of  randomness. 

The  variance  of  D  may  be  calculated  by  methods  similar  to 
those  which  will  be  used  to  calculate  the  third  moment  about  the  mean 
or  it  may  be  obtained  almost  directly  from  (2.8.3)  and  (2.9.5)« 
Either  approach  gives 

(2.9.7)  cr2(D)  =  (m+n-2)  . 


In  obtaining  an  approximation  to  the  distribution  of  D  under 
the  given  assumptions  we  shall  require  the  third  moment,  p  (d)  ,  of 

3 

D  about  its  mean.  First  we  obtain  an  expression  for  the  expected 
* 

value  of  D^. 


Raising  both  sides  of  (2.9.3)  to  the  third  power,  taking 


expectations,  and  combining  terms  of  the  same  type  gives 


r  - .  ..  . 


.  j 


: 


.  ..  : 

. 

-  ....  '  :  >  .  . 


.  - 


■  .  .  ..  •  .  :  . 


c  J  .. 


S-  ...  ..  .  . 

~  ■  -  ......  ;  . 
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(2.9.6)  E(D3)  =  (”)  E(b32)  +  3  (2)(m22)  E(b2£  b^) 


+  3ra(m-l)(m-2)  E(b2£  b^)  +  3  (g)(2)  E(b2g  g^) 

*  Qr*)C?)  E(b12  b56)  +  3 Q)C*)Q  E(b12  bj4  g12) 


+  6m  (",21)(m23)  E(b12  b13  b^)  +  6m  (“j1)  E(blg  b^  b^) 


+  oQ)(m- 2)(in-3)  E(b12  b ^  b^)  +  6m  E(b12  bl*  S12^ 


+  6(,)  E(b  0  b  b  )  +  (terms  obtained  by  interchanging 
^  l£-  e-j  lj5 


m  and  n  and  b. .  and  g. 

ij  ij' 


The  expected  values  of  the  various  terms  are  calculated  separately 
using  the  independence  of  various  variables  whenever  possible. 

,  *  2 

(i)  E(b^2)  =  ^  (n+3)  ,  from  Kendall  and  Stuart  [56] ,  p.  58. 

(There  appears  to  be  a  misprint  in  the  second  equation  from  the  bottom 
of  the  page  in  the  last  reference.  It  reads 

p'  =  n(n-l) (n-2)  p'  +  3n(n-l)  p^  -  2np  , 

but  from  the  process  of  derivation  it  is  easily  seen  that  the  last  -2 
should  be  a  +1  which  gives  the  result  needed  here. ) 

(ii)  E(b22  b^)  =  E(b22)  E(b,4)  =  (£  +  \)  (§)  =  \  (n+1)  , 

using  properties  of  the  binomial  distribution. 

2 

(iii)  E(b^2  b^_.)  =  E(b^2)  Efb^)  =  ^  (n+l)  ,  since  if  the  first 

row  of  the  adjacency  matrix  is  thought  of  as  having  been  filled  in  the 
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expression  becomes  a  function  of  the  entries  in  the  second  and  third 
rows  which  are  independent  of  each  other. 


a,,,  ,  and  a  ,  in  the  adjacency  matrix  of  a  bipartite 

11  12  21  22 

tournament,  may  be  all  the  same.  If  this  is  not  the  case  it  may  be  that 
entries  in  the  same  row  are  alike,  that  entries  in  the  same  column  are 
alike,  that  diagonally  opposite  entries  are  alike,  or  that  three  of  the 
entries  are  of  one  kind  and  the  fourth  is  of  the  other  kind.  The  corr¬ 
esponding  probabilities  of  these  events  are  found  to  be  1/8,  1/8,  1/8, 

2 

1/8,  and  1/2,  respectively.  Calculating  the  expected  value  of  b ^  g^ 
for  the  various  cases  gives  the  result  that 


(iv) 


That  is, 
obvious . 


2  -v  1  rm-2  /m-2N21  ,n-2s 

b12  S12  "8^4  +  (  2  ^  ^  2 

1  r  r-m-2  ,m-2N21  , ,m-2N  ,  •>  ,n-2x 

+|([—  +  (— )  J  +  M— )  +  M  (— ) 


1  rm~2  /.m-2x21/n-2  1  rrm-2  ,m-2\2n 

+  8  +  ](~  +  2)  +  A  +  ] 


8lU 


,  ,m-2\  .  i  ,n-2 

+  M— )  +  +  2) 


1  rr m-2  /m-2\ 2-,  _,m-2v  ii/n’2  mn  (n+l) 

+  g  U—  +  (— )  ]  +  2(— )  +  1}(—  +  1)  =  — 


E(b^2  g12)  =  E(b^)  E(g12^  ’  which  may  not  be  intuitively 


(v)  E(b12  b34  V  ■  E(V  E(V  -  4  • 

2 

(vi)  E(b12  §12^  =  E(bi2^  E( gi2^  =  ~T  »  from  eitber 


the  type  of  procedure  used  in  (iv)  or  that  to  be  illustrated  for  (xi). 


\  \ 


;  i  r  ) 


+ 


;s  -  '  ; 


:  c  + : ' 


+ 


•)  + 


+  V  + 


)  L 


(  )  + 


L 


;  -i- ) 


(  + 


:c  +  ; 


+ 1  \ 


(,  )■(•)(  .* ) 
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(vii) 

E^b 12  b13  b45^ 

=  E(blg) 

E(b13 

( viii) 

E<b12  b13  V 

-  E(b12> 

E(b13 

(ix) 

E<b12  b13  V 

-  E(b12) 

E(bi3 

(x) 

E^b12  b13  S12^ 

"  E(b12> 

E(bi3 

n' 


V 

’lV 

il2) 


n 


n 


8 

3 

8 

3 

8 


2 

n  m 

8 


Statements  (vii)  -  (x)  follow  by  reasoning  similar  to  that 
employed  in  establishing  (iii)  and  (vi)  . 

To  obtain  the  expected  value  of  the  remaining  term  we  observe 
that  we  may  write 


b12  =  ^12  +  Ci2^  +  +  ti2^  ’  where 


t(  j) 
12 


3lj  ■  a2j  I  ’  f°r  j  =  •—  n  ’ 


and  similarly  for  the  other  two  factors.  Upon  substituting  these  express¬ 
ions  into  the  product,  b  b  b.  ,  expanding,  taking  expectations,  and 

12  25  13 

combining  similar  terms,  we  have  that 

E(b12b23b13)  =  n(n-l)(r.-2)  E(  t [b) ) 


*  3n(n-l)  E(ti2)t23)t13)) 

+  n  E(t12  c23  C13  >  • 

But  Eft^t^t^))  =  E(t^)  E(t^)  E(t^)  =  i  ,  since  these 

are  independent,  involving,  as  they  do,  different  columns.  Also, 
E(t^t^t^)  =  E(t^)  >  by  independence.  Considerir 


'  N  '  \  ' 


^  I  v  '  ) 


v  ' 
\ 


■  \  )  \  /  \  \  r  \ 

)  V  •  /  + 


(■  ■  )  (;  1  ).  r  ' 
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the  various  combinations  of  values  which  occur  in  the  first  product 

shows  that  the  expected  value  of  the  complete  product  is  l/k  •  1/2. 

The  consideration  of  these  same  possibilities  shows  that 

E(t^^  tj^  =  0.  Substituting  these  expected  values  gives 

id  dj  i^> 

(xi)  E(b12  b23  b13)  =  §  (n2-l). 

Substituting  the  values  in  (i)  -  (xi)  plus  those  obtained  by 
symmetry  into  (2.9.8)  gives  an  expression  for  E(D'  ). 

But  p_(D)  =  E(D5)  -  5E(D)  o-2(D)  -  E:(D).  Substituting  (2.9.6), 
✓ 

(2.9.7)>  and  (2.9.8)  into  this  equation  and  simplifying  the  expression 
thus  obtained  gives  the  following  result! 


(2.9.9) 


^*(D)  =  “TT”  f  (m-l)(m-2)  +  (n-l)(n-2)]  . 


From  equations  (2.9.5)  and  the  one  preceding  we  see  that  D  is 

related  to  something  having  the  appearance  of  a  variable  which  would  have 

2 

the  X  -  distribution  were  all  the  terms  involved  statistically  independent, 

which  they  aren't.  Nevertheless  we  can  transform  D  into  a  variable 

2 

whose  first  three  moments  are  those  of  a  X  -  distribution. 


The  first  three  moments  of  the  X  -  distribution, 


-  2 

d  Face-2  X  X  dX  , 


are  given  by  p'  =  v  ,  p0  =  2v  ,  and  p_  =  8v.  (See  e.g.  Kendall  and 
1  2  5 

Stuart  [56],  p.  570). 


,  r  ran  (m+n-2 )  _n 

(2.9.10)  x  =  k  [  -jj-  v  ’  -  D]  +  v 


has  mean  v  also. 


.  - 


- 


. 


. 


t 
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Since  p0(Ax  +  B)  =  A  po(x)  for  fixed  A  and  B  we  roust  have 


k  =  (2V)1  (2g  (m+n-2 )  )"2 


in  order  for  the  variance  of  x  to  be  equal  to  2v.  We  must  also  have 


[ (m-1) (tn-2)  +  (n-l)(n-2)]  gv 

= 


[3|  (oH-n-2  )  ] 


(2v )' 


or 


v  = 


ron  (m+n-2 )' 


[(m-l)(m-2)  +  (n-l)(n-2)]‘ 


since  ti_(Ax  +  B)  =  A 0  H-.(  x)  for  fixed  A  and  B, 

3  3 


Substituting  into  (2.9.10)  implies  that 


(2.9.11) 


_ 4  (ro+n-2 ) _  r  ron  (m+n-2) 

X  ~  (m- 1 ) (rn-2 )  +  (n-l)(n-2)  L  4 


D]  +  v 


has  its  first  three  moments  the  same  as  a  X,  -  distribution  with  v 
degrees  of  freedom. 


As  an  example  we  consider  the  case  m  =  n  =  4.  By  direct 

1(5 

enumeration  it  may  be  seen  that,  barring  errors,  3042  of  the  2 
4  by  4  bipartite  tournaments  give  a  value  of  D  less  than  or  equal  to 
16.  Denoting  the  probability  of  the  occurrence  of  the  event  A  by 
Pr(A)  we  have,  approximately,  that  Pr(D  <  lo)  =  0.046  in  this  case. 

(2.9.11)  becomes  x  =  2[24  -  D]  +  24  and  from  tables  of  the 

2 

X  -  distribution  with  24  degrees  of  freedom  we  find,  approximately, 


that  Pr (x  >  40)  =  0.022. 


r 


v  . 


j 


7 


I 
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It  is  not  difficult  to  see  that  D  ^  17  for  any  4  by  4 
tournament.  Hence,  the  accuracy  of  the  approximation  for  this  case 
may  be  improved  by  replacing  D  by  D+l  ,  where  the  1  represents 
the  so-called  correction  for  continuity.  (For  a  discussion  of  this 
see  Kendall  [55] >  P.  42  and  p.  155>  where  the  same  procedure  as  used 
here  is  carried  out  for  a  different  problem  for  ordinary  tournaments. ) 
Doing  this  the  probability  that  D  is  less  than  or  equal  to  16  should 
be  near  to  the  probability  that  x  ,  as  defined  above,  is  greater  than 
or  equal  to  $8 .  From  tables  this  latter  probability  is  found  to  be 
approximately  0. 038  which  is  somewhat  nearer  to  the  actual  value, 

0.046. 


Presumably  the  agreement  would  be  better  for  larger  values  of 
m  and  n. 

2.10  The  number  of  acyclic  bipartite  tournaments 

A  tournament  which  contains  no  oriented  cycles  is  said  to  be 
acyclic .  Up  to  an  isomorphism  there  is  only  one  ordinary  tournament 
on  n  points  which  is  acyclic.  This  is  the  graph  called  a  hierarchy 
in  §2.8.  If  the  points  of  the  tournament  are  labelled  then  clearly 
there  are  n.’  ways  of  ranking  them  so  as  to  form  a  hierarchy.  The 
counting  of  the  number  of  acyclic  bipartite  tournaments  is  not  quite 
as  trivial  and  will  be  carried  out  in  this  section.  Both  the  labelled 
and  unlabelled  cases  will  be  treated. 

If  in  a  bipartite  tournament  all  of  the  points  had  an  outdegree 
greater  than  zero  one  could  go  indefinitely  from  point  to  point  along 
an  oriented  path.  Since  the  graph  contains  only  a  finite  number  of 
points  this  path  would  eventually  pass  through  the  same  point  twice, 


. 

. 


, 
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thus  forming  a  cycle.  Hence,  every  acyclic  bipartite  tournament 
contains  some  points  of  outdegree  zero  which,  of  necessity,  must  all 
be  in  the  same  point  set,  say,  P.  With  this  assumption  a  moment’s 
reflection  suffices  to  establish  the  validity  of  the  following  assertion, 
since  the  removal  of  points  of  outdegree  zero  and  the  edges  incident 
upon  them  from  an  acyclic  bipartite  tournament  leaves  an  acyclic  graph 
remaining.  In  every  acyclic  m  by  n  tournament,  in  which  the  points 
of  outdegree  zero  are  points  of  P  ,  the  points  of  P  and  Q  may  be 
split  into  k  and  i  mutually  exclusive  and  exhaustive  non-vacuous 
classes,  where  the  ith  and  jth  class  contain  r^  and  s^  points, 
respectively, 


Jc  j : 

(2.10.1)  k-i  =  1  or  0  ,  ^  r.  =  m  ,  ^  s .  =  n  , 

j=l 


i=l 


and  such  that  the  edges  joining  the  points  in  the  ith  class  of  P 
points  to  the  points  in  the  first  i-1  classes  of  Q  points, 
i  =  2,  ...,  k  ,  are  oriented  towards  the  points  in  Q  and  all  the 
remaining  edges  are  oriented  towards  points  in  P.  Furthermore, 
each  such  acyclic  bipartite  tournament  determines  uniquely  the  r^'s 
and  sj’s  >  order,  and  the  converse  as  well  is  true  up  to  an 
isomorphism.  Therefore,  the  number  of  nonisoroorphic  acyclic  m  by  n 
tournaments  in  which  the  points  of  outdegree  zero  are  points  of  P 
is  the  number  of  ordered  sets  of  strictly  positive  integers  satisfying 
(2.10.1).  The  remaining  case  may  be  treated  by  symmetry. 

If  k=i=r,  r  =  1,  2,  ...,  min[m,n]  ,  the  number  of 
solutions  to  (2.10.1)  is  the  product  of  the  number  of  compositions 


... 

. 
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of  m  and  n  ,  respectively,  into  r  parts,  or  (see  Riordan  [84], 


P.  124) 


If  k-1  =  i  =  r  ,  r  =  1,  2,  . ..,  min[m-l,n]  ,  the  number  of 
solutions  to  (2.10.1)  is  the  product  of  the  number  of  compositions 
into  r+1  and  r  parts  of  m  and  n  ,  respectively,  or 

r;1)  (":];) . 

Upon  interchanging  m  and  n  to  treat  the  case  where  the  points 
of  outdegree  zero  are  Q  points  and  summing  over  r  we  have  the 
following  result,  where  ^  denotes  the  number  of  nonisomorphic 
acyclic  m  by  n  tournaments. 

Theorem  2.10.1. 


min[m,n] 


min[m-l,n] 


r=l 


r=l 


min[m,n-l] 


+ 


r=l 


Corollary  2. 10.1. 


[rt (m-1 ) ] 


-2  02ro+l 


Setting  m  =  n  in  Theorem  2.10.1  gives 


. 


■  • 


. 


•  • 
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ro-l 

2{I  c:i)[0  +  0i  +  ci)2} 

r=l 

ro  m 

= 2 1  Ci>  O  <  2 1  o2 

r=l  r=0 

t  \~2  _2m+l 

~  \it  m)  2  , 

using  well  known  binomial  identities  and  Stirling’s  formula.  (See 
e.g.  Feller  [26],  pp.  62-63. )  The  other  half  of  the  corollary  may  be 
shown  in  a  similar  fashion. 

Next  we  determine  L  ,  the  number  of  acyclic  m  by  n 

m,n  3  J  J 

tournaments  with  labelled  points.  The  number  of  such  tournaments,  in 
which  the  points  of  outdegree  zero  are  P  points,  and  in  which 
k  =  Z  =  r  ,  r  =  1,  2,  ...,  roin[m,n]  ,  where  k  and  i  have  the 
same  meaning  as  before,  is  the  product  of  the  number  of  ways  of 
putting  m  and  n  different  things  into  r  different  cells, 
respectively,  with  no  cell  empty,  or  (see  Riordan  [84],  p.  91) 

r.'  S(ro,r)  r]  S(n,r)  , 

where  S(m,j)  denotes  a  Stirling  number  of  the  second  kind  defined  by 

m 

S(tn,  j)  ^  ,  m  >  0  . 
j=° 

Analogous  results  for  the  other  cases  are  obtained  in  a  similar 


=  2(2m) 
'  nr 


K 

m,m 


fashion  and  combining  them  gives 


;  . 

p 

.  . 

. 

-  91  - 


Theorem  2.10.2. 

min[ro,n] 

L^n  =2  ^  (r.'  )2  S(ro,r)  S(n,r) 

r=l 

min[m-l,n] 

+  ^  (r+l)I  r.'  S(ro,r+l)  S(n,r) 

r=l 

min[m,n-l] 

+  ^  (r+l)J  r l  S(m,r)  S(n,r+l) 

r=l 


Corollary  2.10.2. 


m 


Lm,ro  =  2  2.,  ^r*  ^  S^1’  r+1)  S(ffl>r)  • 


r=l 


The  proof  of  this  is  similar  to  part  of  that  of  Corollary  2.10.1 
except  that  the  identity  (see  Riordan  [64] ,  p.  33)  used  here  is 


S(m,r)  +  (r+1 )  S(ro,r+l)  =  S(m+1,  r+l)  . 

2. 11  A  condition  on  the  score  sequence  of  a  bipartite  tournament  which 
implies  the  existence  of  cycles 

An  ordinary  tournament  on  n  points  is  acyclic  if,  and  only  if, 
its  score  sequence  consists  of  the  numbers  0,  1,  ...,  n-1  ,  or 
equivalently,  if,  and  only  if,  the  sum  of  the  squares  of  the  scores  of 
its  points  is  S  (n-l)(2n-l).  The  latter  formulation  was  a  problem  on 
the  19>8  Putnam  Examination  [5].  In  this  section  we  derive  a  somewhat 
similar  appearing  condition  on  the  score  sequence  of  a  bipartite 
tournament  which  may  indicate  whether  or  not  it  is  acyclic.  The  price 
of  maintaining  simplicity,  however,  is  a  loss  of  generality  as  the 
condition  to  be  derived  can  only  show,  sometimes,  when  a  bipartite 


tournament  contains  cycles 


.  ■  . 

. 

*  *  n 


• 

-  92  - 


We  have  already  seen,  in  §2.10,  that  if  the  scores  of  the  points 
of  a  bipartite  tournament  are  all  strictly  positive  then  the  graph 
contains  at  least  one  oriented  cycle.  Dulmage  and  Mendelsohn  [16] 
have  given  a  related  condition  for  ordinary  unoriented  bigraphs. 


Theorem  2.11,1.  Let  there  be  given  an  m  by  n  bipartite 

tournament,  m  >  n  ,  whose  score  sequence,  arranged  in  non-decreasing 
order,  is  V  =  (v^  v ...,  v^)  and  U  =  (u^  u£,  ...,  un).  If  it 
is  acyclic  then 


S2  (m,n) 


m  n 


^  n(2n  +  1)  .  ,  ^  2 

>  — — -  +  (m-n;n  . 


For  an  arbitrary  value  of  m  the  proof  will  be  by  limited 
induction  upon  n. 


The  statement  is  vacuously  true  for  n  =  C. 

If  n  =  1  there  are  no  cycles  possible  and  Sc(m, l)  =  (m-u^)  »1 
2  2 

+  1  *  u^  >  m  ,  since  u^  >  u^  ,  which  verifies  the  theorem  for  this 
case. 

Assume  the  theorem  has  been  proved,  for  an  arbitrary  value  of 
m  ,  for  n=0,  1,  ...,  k<m,  and  consider  any  acyclic  m  by  (k+l) 
tournament.  Then  u^  ^  =  m-t  ,  where  t  is  the  number  of  P  points 
whose  score  is  k+l  ,  from  the  discussion  of  the  structure  of  acyclic 
bipartite  tournaments  in  general  and  the  hypothesis  that  the  scores 
were  in  non-decreasing  order.  The  removal  of  the  point  and  the 

edges  incident  upon  it  leaves  an  acyclic  m  by  k  tournament  to  which 


. 

•  :f  ■ 


.  .  . 

. 
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the  induction  hypothesis  may  be  applied  to  imply  that 

S2(m,k+l)-(m-t)2  -  t[(k+l)2-k2]  >M£L±1)  +  (m-k)  k2  . 

But  this  implies,  upon  rearranging  and  using  the  fact  that 
2 

(ro-k-t)  >  m-k- t  , 

that 

Sg (m } k+1 )  >  +  (m-k-l)  (k+l)2  , 

which  suffices  to  complete  the  proof  of  the  theorem  by  induction. 

That  equality  is  possible  in  the  conclusion  of  this  theorem  is 
shown  by  the  m  by  n  bipartite  tournament  whose  score  sequence  is 

V  =  (0,  1,  2 ,  ...  ,  n-1.  n,  ...  ,  n) 

ro-n  terms 

and 

U  =  (1,  2,  ...  ,  n)  . 

The  converse  of  Theorem  2.11.1  is  not  necessarily  valid  for  it 
is  easy  to  construct  bipartite  tournaments  whose  score  sequences 
satisfy  the  inequality  of  the  theorem  but  which  contain  cycles. 

2. 12  On  the  distribution  of  4-cycles  in  random  bipartite  tournaments 
When  an  ordinary  tournament  on  n  points  is  interpreted  as 
representing  an  attempt  to  rank  n  distinct  elements  according  to  some 
criterion  the  presence  of  cycles  indicates  a  degree  of  inconsistency  or 
lack  of  transitivity  in  the  ranking  process.  Significance  tests  for 
the  degree  of  consistency  present  in  an  attempted  ranking  of  n  elements 
by  the  method  of  paired  comparisons  have  been  developed  in  which  the 


Si  ... 

■  - 
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number  of  cycles  of  a  given  type  in  the  corresponding  tournament  is 
compared  with  the  number  which  would  be  expected,  were  all  the 
comparisons  made  at  random.  The  investigation  of  the  complete 
distribution  of  cycles  in  general  appears  to  be  difficult.  Kendall 
and  Babington-Smith  [5^-],  p.  3^7  >  express  it  as  follows!  "In 
discussing  inconsistences,  therefore,  it  seems  best  to  confine  attention 
to  circular  triads  which,  so  to  speak,  constitute  the  inconsistent 
elements  of  the  configuration."  A  circular  triad  is  the  same  as  a 
cycle  of  length  three,  or  a  3_cycle>  in  our  terminology.  They 
obtained  an  expression  for  the  number  of  3_cycles  a  tournament  contains 
in  terms  of  its  score  sequence  and  determined  the  maximum  possible 
number  of  3_cycles  in  any  tournament.  An  independent  derivation  of 
the  number  of  3_cycles  in  a  tournament  in  terms  of  the  scores  by 
Moser  [67]  leads  to  the  maximum  number  of  3“°ycles  possible  in  a 
somewhat  more  straightforward  manner.  Moran  [66]  showed  that  the 
distribution  of  the  number  of  3_cycies  in  a  tournament,  under  the 
hypothesis  of  randomness,  tends  to  normality  as  the  number  of  points 
tends  to  infinity  and  obtained  the  first  four  moments  of  the  distribution. 

Let  C(m,n)  denote  the  number  of  cycles  of  length  four  in  an 
m  by  n  tournament.  Attention  is  concentrated  on  cycles  of  length 
four  since  they  are  the  simplest  cycles  to  study  and  because  if  a 
bipartite  tournament  has  any  cycles  at  all  then  it  has  some  of  length 
four,  as  is  not  difficult  to  establish.  C(m,n)  provides,  in  a  sense, 
a  measure  of  the  degree  of  transitivity,  or  rather  the  lack  of  it,  of 
the  relationships  indicated  by  the  orientations  of  the  edges,  when  a 
bipartite  tournament  is  thought  of  as  representing  the  outcome  of 
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comparing  each  member  of  one  population  with  each  member  of  a  second 
population  and  making  a  decision,  upon  some  basis,  as  to  which  component 
of  each  pair  is  the  preferred  one. 

The  impossibility  of  obtaining  an  expression  for  C(m,n)  as  a 
function  of  the  score  sequence  alone  is  demonstrated  by  the  following 
adjacency  matrices,  and  ,  which  correspond  to  two  bipartite 

tournaments  having  the  same  score  sequence  but  it  can  be  seen  that 
they  do  not  contain  the  same  number  of  4-cycles. 


1 

1 

0 

0 

1 

1 

0 

0 

1 

1 

0 

0 

,  and  A  = 

0 

1 

1 

0 

0 

0 

1 

1 

d 

0 

0 

1 

1 

0 

0 

1 

1 

1 

0 

0 

1 

In  this  section  we  will  obtain  sharp  upper  bounds  for  C(ra,n) 
and  prove  that  under  mild  restrictions  on  the  relative  rates  of  growth 
of  m  and  n  the  distribution  of  C(ro,n)  tends  to  normality  as  m 
and  n  tend  to  infinity. 

For  an  arbitrary  m  by  n  tournament  let  v^  denote,  as  usual, 
the  score  of  P^  ,  for  i  =  1,  . ..,  m.  Then 

m 

(2.12.1)  V  vi(n  -  v.) 
i=l 

equals  the  number  of  ordered  pairs  of  distinct  points,  (Q^,  Q^)  ,  for 
which  there  exists  a  point,  P^  ,  such  that  P_^  — >  ,  counting 


multiplicities 


, 
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Number  the  ordered  pairs  of  distinct  points,  (Q^,  Q^)  ,  from 

1  to  n(n-l)  in  such  a  way  that  if  the  ordered  pair  (Q^,  Q^) 

corresponds  to  s  ,  where  s  =  1,  2,  . ..,  n(n-l)  ,  then  the  ordered 

pair  (Q.,  Q  )  corresponds  to  n(n-l)  -  (s-l).  Let  t  denote  the 
Xj  k  s 

number  of  times  the  ordered  pair  corresponding  to  s  is  counted  in 

(2.12.1) .  Then  it  follows  that 

<!> 

(2.12.2)  C(m,n)  =  ^  Cs  ^(n-lMs-l)  * 

S=1 

Assume  temporarily  that  m  =  n  =  0(2).  We  then  observe  that 
n(n-l ) 

(2.12.5)  )  tg 

S  =  1 


(“) 


Z_<  +  t'n(n-l)-(s-l)  ^ 


s=l 


m 


=  ^  vi^n_vi^  -  m  r2a  • 


i=l 


It  is  now  easily  seen  that  an  upper  bound  for  C(m,n)  in  (2.12.2) 
results  by  first  having 


(2.12.4) 

2 


£  11 
for  T- 

4 

setting 


ts  +  tn(n-l)-(s-l)  111 

values  of  s  and  zero  for  the  remaining  values  and  then 

t  =  t  /  ,\  /  This  implies 

s  n(n-l)-(s-l)  r 


Theorem  2. 12. 1. 


2  2 

C(ro,n)  if  5  n  =  0(2)  . 


m 


m2-l 


If  ro  is  odd  then  is  replaced  by  — ,  and  similarly  for 
n  ,  as  may  be  seen  by  making  the  appropriate  changes  in  (2.12.3)  and 
(2.12.4). 
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That  the  upper  bound  is  sharp  for  the  case  m  =  n  s  0(2)  is 
demonstrated  by  the  bipartite  tournament  in  which  P.  — i ►  Q  if,  and 

X  K. 

only  if,  either  i  <  m/2  and  k  <  n/2  or  i  >  m/2  and  k  >  n/2  , 
otherwise  Q  ->P..  Equally  as  simple  examples  suffice  to  show  that 

cC  X 

the  bound  is  sharp  for  the  other  cases  also. 

In  an  m  by  n  tournament  define  a  random  variable,  S(ij;  ki)  , 
i, j  =  1,  2,  ...,  m,  i  /  j  ,  and  k, i  =  1,  2,  n,  k  /  i  ,  to  be 

1  or  0  according  as  to  whether  th« points  P^  ,  P  ,  ,  and  do 

or  do  not  form  a  4-cycle,  respectively.  Since  only  2  of  the  16 
equally  likely  ways  of  orienting  the  edges  between  these  points  yield 
a  4-cycle  it  follows  that  E[S(ijJ  ki)]  =  1/8.  Summing  over  all 
suitable  pairs,  (i,  j)  and  (k, i)  ,  implies  that 

(2.12.5)  E[C(m,n)]  =  g  (”)  (")  . 


Similarly, 

e[c2( 

=  E[( 

)  s(ij;  w)f]  , 

where  the  Siam 

is  again  over  the 

pairs 

(i>  j 

)  and 

(k,i).  In  the  expansion  of  this 

the  various  types 

of  such  products  ; 

of  products 

are  found  to 

involved  and  the  number 

be  as  follows! 

and  expected  value 

s(ij; 

ki) 

s(ij; 

ki) 

QQ 

1/6 

s(ij; 

ki) 

s(ij; 

ks  ) 

2n  ("H**-1) 

1/32 

s(ij; 

ki) 

s(ih; 

ki) 

Mm g1)^) 

1/32 

s(u; 

ki) 

s(ij; 

rs) 

(p(^)(n22) 

1/64 

(2.12.6)  s(ij; 

ki) 

s(gh; 

ki) 

(pr^K") 

1/64 

s(ij; 

ki) 

s(ih; 

ks  ) 

wr-'x"-1) 

1/64 

s(ij; 

ki) 

s(ih; 

rs) 

a^r'hfg)^2) 

1/64 

-  "  i  •  .  4  | ;  j  -j 

- 

... 

• 

. 
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S(ij;  kt)  s(gh;  ks)  2n(”)(”’‘2)(ng1)  1/64 

S(ij;  kf)  s(gh;  rs)  (2)C”g2)(g)(n22)  1/64  . 

Combining  these  we  have,  where  denotes  the  kth  moment  of 

C(m,n)  about  its  mean,  that 

(2.12.7)  U2  =  E[C2(m,n)]  -  E2[C(m,n)] 

=  \  (”)  (")  (2~.+  2n  -  1)  . 


To  show  that  the  distribution  of  C(m, n)  tends  to  normality  it 
suffices  (Kendall  and  Stuart  [p6],  P.  Hi)  to  show  that,  for  h  =  1,  2,  ..., 


(i) 


^2h+l 

u  t(2h+l) 

h-2 


0 


and  (ii) 


(2h): 

2h  hi 


> 


as  ro  and  n  tend  to  infinity. 


We  shall  temporarily  assume  that  n  =  o(m)  as  m  and  n  tend 
to  infinity. 

If  T(ij;  kl)  =  S(ij;  k l)  -  1/8  ,  then  ^2h+1  = 

E[(  \  l(ijj  kf))2^+^]  ,  for  h  =  1,  2,  ...  ,  where  the  sum  is  again 
over  the  pairs  (i,  j)  and  (k, i).  A  typical  term  in  the  expansion 
of  this,  for  a  fixed  value  of  h  ,  is  T(i^j^J  k^i^)  ••• 

^^2h+l  ^2h+P  ^2h+l  ^2h+l^*  Put  t*ie  t^1^s  term,  which  have 

any  values  of  i,  j,  k,  or  i  in  common  with  those  of  the  first  factor 

in  a  class  with  it.  Add  to  this  class  any  T's,  still  in  this  same 
term,  which  have  any  values  of  i,  j,  k,  or  &  in  common  with  those 
of  any  of  the  T's  already  in  this  class  and  continue  this  process  as 
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long  as  is  possible.  In  a  similar  fashion  form  another  class  starting 
with,  say,  the  first  factor  not  already  included  in  the  first  class. 

By  repeating  this  process  any  term  may  be  expressed  as  the  product 
of  classes  of  products  such  that  no  two  factors  in  different  classes 
have  any  values  of  i,  j,  k,  or  i  in  common,  while  for  any  factor  in 
a  class  containing  more  than  one  factor  there  is  another  factor  in  the 
same  class  with  which  it  does  have  a  value  of  i,  j,  k,  or  S,  in  common. 

Combining  all  terms  which  have  similar  combinations  of  (i,j) 
and  (k,  i)  occurring,  as  was  done  in  (2.12.6)  for  the  second  moment, 
the  number  of  times  terms  of  a  given  type  appear  will  be  a  polynomial 
in  m  and  n  whose  largest  term  is  of  order  equal  to  the  number  of 
distinct  values  of  i  and  j  ,  and  k  and  "  Jl  ,  respectively,  that 
appear  in  the  term. 

If  any  class  in  a  terra  contains  just  one  factor  the  expected  value 
of  that  term  will  equal  the  expected  value  of  the  product  of  the 
remaining  factors  times  the  expected  value  of  the  single  factor,  or 
zero.  Also,  any  term  with  more  than  h  classes  has  expected  value 
zero  since  it  is  forced  to  have  at  least  one  class  containing  a  single 
factor. 


Restricting  ourselves  now  to  terms  all  of  whose  classes  contain 
at  least  two  factors  we  may  make  the  following  assertion!  If  such  a 
term  is  to  have  a  non- zero  expectation,  then  each  factor  in  a  class  may 
include  at  most  one  value  of  i,  j,  k,  or  i  not  included  in  any  other 
factor  in  the  class.  For,  if  any  factor  has  fewer  than  three  values  of 
i,  j,  k,  and  i  in  common  with  those  of  the  remaining  factors  in  the 
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class,  then  its  expectation  is  independent  of  the  expectations  of  the 
remaining  factors,  as  was  seen  in  (2.12.6).  As  its  expectation  is  zero 
the  expectation  of  the  whole  term  is  consequently  zero. 


Therefore,  for  any  class  of  factors  in  a  term  the  largest 
number  of  distinct  values  of  i,  j,  k,  and  i  that  may  occur  in  it 
is  four  for  the  first  factor  and  one  new  value  for  each  of  the  remaining 
factors.  Hence,  the  term  containing  the  largest  number  of  different 
values  of  i,  j,  k,  and  l  with  a  non-zero  expectation  will  have  h-1 
classes  with  two  factors  each  and  one  class  of  three  factors.  From  the 
hypothesis  on  the  relative  orders  of  ro  and  n  we  see  that  the  largest 
term  in  l^h+l  *'S  contributed  by  products  of  the  form 


h-1 


n  MsW.)  T^8sjs^ksis^  ^hWh^  T(8hWh)  T(fhWh^ 


s=l 


where  different  letters  represent  different  numbers. 


Therefore,  is  a  polynomial  whose  largest  term  is  of  degree 

3h+l  in  m  and  2h  in  n.  Hence,  for  fixed  h  and  m  and  n  tending 
to  infinity  in  the  prescribed  manner,  it  follows  that 


(2.12.6) 


^2h+l 

u  t(2h+l) 

^2 


0 


3h+l  2h  \ 
nr  n  \ 

3/2(2h+l)  n2h+l  J 


In  considering  p.^  the  same  type  of  argument  may  be  used  to 

assert  that  its  highest  ordered  terms  arise  from  products  of  the  following 
h 

type:  II  T<VS;  Vs)  T(gsV  Vs>  • 

S  =  1 


. 
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The  number  of  times  terms  of  this  type  occur  is 


(2h):(“)r'h)(g) 


^m-Jh+2  ^n-2h+2  ^ 


3h  2h 
m  n 


(2h)I 


22h  h' 


retaining  only  terms  of  highest  order. 


The  expectation  of  such  a  term  equals  {E  [t(  i  j*ki)T(g y,  ki)  ]  =  1/2^  , 


as  may  be  seen  by  direct  considerations.  Hence,  the  leading  term  of  p, 


2h 


— — 75^ - 1».  and  that  of  p  is  nr'n^/2^ .  Therefore,  for  fixed 


15 


h  and  ro  and  n  tending  to  infinity  in  the  prescribed  manner,  it 
follows  that 


(2.12.9) 


2h 


P, 


(2h): 

2h  hi 


which  completes  the  verification  of  (i)  and  (ii)  for  this  case. 


For  each  fixed  value  of  h  the  only  essential  difference  in 
the  arguments  when  n  — > cm  ,  0  <  c  <  1  ,  as  m  and  n  tend  to  infinity, 

will  be  the  appearance  of  the  additional  factor  (l+c)  in  both  p0^ 
and  pg  which  clearly  leaves  (2.12.9)  unchanged. 

Under  the  given  conditions  this  completes  the  proof  of 
Theorem  2.12.2. 


f  C(m,n)  -  |  (™)  Q) 

Pr  ,  a  <  - - - — - — — r  <  b 

i[(p(^)(2m+2n-l)P 


dy  . 
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2. 15  The  expected  number  of  cycles  in  a  random  bipartite  tournament 
In  spite  of  the  fact  that  the  exact  distribution  of  the  number 
of  cycles  of  a  given  length  in  a  random  tournament  does  not  appear  to 
be  known  in  general  it  is  not  difficult  to  calculate  the  expected 
number  of  cycles  of  a  given  length.  In  this  section  we  treat  the 
corresponding  problem  for  bipartite  tournaments. 

Let  C^,(m,n)  ,  £  =  2,  . . . ,  n  ,  denote  the  number  of  cycles  of 

length  2£  in  an  m  by  n  tournament,  where  m  >  n  >  2.  Under  the 
hypothesis  of  randomness  it  follows  that  the  probability  that  any  set 
of  i  distinct  P  points  and  H  distinct  Q  points  will  form  a 
cycle  of  length  21  is 

2i2_2i  /  =  i[U-l):]2  /  22i  , 

2 

since  the  points  on  the  cycle  can  be  ordered  in  i[(H-l)']  ways  and 
there  remain  t- 2&  edges  each  of  whose  orientation  may  be  chosen  in 
one  of  two  ways.  Summing  over  all  such  sets  of  i  distinct  P  points 
and  i  distinct  Q  points  gives 

(2.13.1)  E[Ci  (m,n)]  =  (”)  (")  l ((i-l)J  f/221  . 

For  i  =  2  this  gives  (2.12.5). 

A  complete  (oriented)  cycle  is  one  which  includes  each  point  of 
the  graph  once  and  only  once.  A  bipartite  tournament  cannot  contain  a 
complete  cycle  unless  m  =  n  in  which  case 

2 

(2.13.2)  E[Cm  (m.ra)]  =  ~  £*  (~fw  , 

ra  2 


from  Stirling's  formula 
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Moser  [67]  has  pointed  out  that  there  seem  to  be  no  known 
examples  of  ordinary  tournaments  on  n  points  which  in  general  contain 
as  many  complete  cycles  as  does  the  average  tournament  on  n  points. 

For  bipartite  tournaments  this  presents  no  difficulty,  however.  If  ro 
is  even  consider  the  m  by  m  bipartite  tournament  in  which  -►  Q 
if,  and  only  if,  both  i  and  j  are  less  than  or  equal  to  m/2  or 
both  i  and  j  are  greater  than  m/2.  Otherwise  — » P^.  It  is 

easily  seen  that  the  number  of  complete  cycles  contained  in  this  graph  is 

(2.13.3)  “  [(§)J]^  ~  2  t?  m  (g")21*  ~  it  m  E[Cm(m,m)]  . 

Slight  modifications  yield  asymptotically  the  same  number  of 
complete  cycles  when  m  is  odd. 

A  possible  conjecture  would  be  that  the  left  member  of  (2.13.3) 
is  the  maximum  possible  number  of  complete  cycles  in  an  m  by  m 
tournament  when  m  is  even.  The  likelihood  of  this  is  increased  by 
the  fact  that  it  was  essentially  this  same  bipartite  tournament  which 
contained  the  maximum  possible  number  of  4-cycles  in  connection  with 
Theorem  2. 12. 1. 

The  expected  number  of  paths  of  a  given  length  in  an  m  by  n 
tournament  can  be  obtained  in  the  same  way  as  was  the  expected  number 
of  cycles  of  a  given  length. 

Redei  [80]  has  shown  that  every  ordinary  tournament  on  n  points 
contains  a  path  of  length  n-1  and,  as  mentioned  in  §2.5>  an  ordinary 
tournament  contains  a  complete  cycle  if,  and  only  if,  it  is  irreducible 
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from  results  of  Camion  [6]  and  Roy  [86],  Foulkes  proof  of  this  latter 
statement  in  [27]  is  incomplete  as  the  tacit  assumption  is  made  that  a 
tournament  obtained  from  an  irreducible  tournament  by  the  removal  of 
any  one  point  and  the  edges  incident  upon  it  is  always  irreducible 
itself,  the  falsity  of  which  may  be  demonstrated  by  examples.  That 
neither  the  theorems  of  Redei  nor  Caroion-Roy  hold  for  m  by  ro  bipartite 
tournaments  in  general  may  be  shown  by  examples  although  the  conditions 
in  the  latter  theorem  are  clearly  necessary.  Another  condition  which 
is  obviously  necessary  for  the  existence  of  a  complete  cycle  in  an 
m  by  m  tournament  is  that  no  more  than  j  points  in  one  point  set, 
each  of  whose  score  is  j  ,  have  all  the  j  edges  which  are  oriented 
away  from  each  of  them  oriented  towards  the  same  j  points  in  the 
other  point  set,  for  j  =  1,  2,  ...,  m  .  A  conjecture  is  that  this 
condition  along  with  irreducibility  is  sufficient  to  insure  the 
existence  of  a  complete  cycle.  A  slight  modification  gives  a  necessary 
condition  for  the  existence  of  a  path  of  length  2m-l  in  an  m  by  ro 
tournament. 

If  one  removes  the  restriction  that  no  point  is  to  be  included 
more  than  once  in  a  cycle  or  path  then  it  follows  from  the  definition 
of  matrix  multiplication  that  the  total  number  of  cycles  of  length  2i  , 
i  =  2,  *,  . • .  }  an  m  by  n  tournament  is  equal  to 

\  tr  [A-C]^  =  j  tr  [C*  A] ^  , 

where  A  and  C  are  the  adjacency  matrices  associated  with  the 

i 

tournament  as  defined  in  §2.9.  The  diagonal  elements  of  [A»C]  and 
[C«A]  are  the  number  of  paths  of  length  2Z  beginning  and  ending 


with  a  given  point.  The  trace  is  the  sum  of  these  numbers  and  it  is 
necessary  to  divide  by  £  since  each  such  cycle  is  counted  £  times. 
Various  other  configurations  may  be  counted  in  a  similar  fashion.  For 
corresponding  considerations  for  ordinary  directed  graphs  see  e.g. 
Kemeny,  Snell,  and  Thompson  [53]  >  chapter  or  [11],  unit  IV  -  27. 

The  problem  of  determining  the  number  of  cycles  and  paths  of  a 
given  length,  in  the  sense  of  our  original  definition,  from  the  number 
of  cycles  and  paths  where  some  points  may  be  involved  more  than  once 
appears  to  be  quite  complicated.  For  ordinary  directed  graphs  this  is 
treated  in  Harary  and  Ross,  [34]  and  [37].  These  papers  contain 
references  to  other  relevant  papers. 

2. 14  On  the  numb er  of  isolates  in  a  bipartite  tournament 

By  an  isolate,  in  a  bipartite  tournament,  is  meant  a  point 
whose  outdegree  is  zero.  It  is  impossible  for  there  to  be  isolates 
among  both  the  P  and  Q  points  of  a  bipartite  tournament 
simultaneously.  In  this  section  the  expected  value  and  variance  of 
the  number  of  isolates  is  derived  under  the  hypothesis  of  randomness 
described  earlier. 

An  analogous  problem  for  ordinary  directed  graphs  where  the 
points  are  interpreted  as  representing  people  and  the  edges  represent 
choices  for  friends  has  been  treated  by  Katz  [49].  (See  also  Katz 
and  Powell  [52].) 

In  an  m  by  n  tournament  in  which  the  outdegrees  of  the  points 
P^  and  Q^.  are  denoted  by  v_^  and  u_.  ,  respectively,  let  Ip(m,n*  t 
be  the  number  of  isolates  among  the  points  of  P  ,  where 


-  .  (_ 


-  '  .  ...  :  .  . 

- 

'  - 

"  - .  . 

:  ] 


.c 


;•  - 
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m 


y>  ^  v^  =  t.  If  denotes  the  event  that  ,  i  =  1,  m  ,  is 


i=l 

an  isolate,  then  (see  e.g.  Feller  [26],  p.  96) 

m-k 

Pr  [lp(m,n*  t)  =  k]  =  ^  (-l)^  (l^J)  Sk+j  ,  for  k  =  0,  1,  . . . ,  id  , 

where 


Wj  =  I Pr  uy  v  ’  \+j]  ’ 


the  latter  sum  being  over  all  distinct  combinations  of  k+j  of  the 
events  A^. 


But 


_  ,  m  v  rn(m-k-j)x  ,  ,m  nx 

bk+j  'k+j'  ^  t  )  /  \  t  )  > 


as  there  are  (k^\)  ways  of  selecting  the  k+j  points  of  P  which  are 


to 


be  isolates  and  (n(m“k  j))  wayS  0f  choosing  t  of  the  n(m-k-j) 


edges  not  incident  upon  the  isolates  to  be  oriented  towards  a  Q  point. 
(in^n)  the  total  number  of  m  by  n  tournaments  containing  t 

edges  oriented  towards  a  point  of  Q. 

Therefore, 

m-k 

(2.14.1)  (”tn)  Pr(lp(m,n;  t)  =  k)  =  ^  (-1)J  (k+J )(k”. )(n(,”‘k‘ i ) ) 

j=0 


m-k 


=  (”)  ^  (-DJ  C”-k)(n("’;k-J)) 


j=0 


In  particular, 


J  1  hi  . 
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(2.14.2)  rcn)  Pr(lp(»,n;t)  =  0)  =  ^  (-DJ  (")  (n("'J)) 

j=0 

r 

0  ,  if  t  <  id, 

j 


The  sum  is  over  all  sets  of  nonnegative  integers,  (a^,  a^,  a^)  , 

such  that 


and 


1*  a.  +  2*  a0  +  . . .  +  n*  a  =  t  , 
1  2  n  7 


al  +  a2  +  •  “  ’  +  an  =  m  * 


The  proof  of  the  last  part  is  immediate  upon  observing  that  if  there 
are  a_^  P  points  of  outdegree  i  ,  for  i  =  1,  2,  . ..,  n  ,  the  number 
of  ways  of  selecting  these  is  the  multinomial  coefficient  in  the 
expression  and  for  each  point  of  outdegree  i  there  are  (^)  ways  of 
selecting  i  of  the  Q  points  such  that  the  edges  joining  the  P 
point  to  these  points  are  oriented  towards  the  Q  points.  The  last 
expression  includes  the  first  two  as  special  cases.  The  identity 
resulting  in  these  cases  is  contained  in  Netto  [70],  p.  2 p6,  but  the 
proof  there  is  by  equating  coefficients  of  similar  terms  in  an  algebraic 
identity. 


For  example,  when 


m  =  J,  n  =  4  ,  and  t  =  (2.14.2) 


becomes  (g)^3)  -  -  624  =  +  (2^)(J)2(^) 


If  m  =  n  ,  then 


mn 


- 


f 


L 


, 
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m 


(2.14.5)  Pr(l  (n.,n;t)  =  0)  =  )  (-l)JC")  W^-j)](t) 

^  J  (mS)- 


>0 


(t) 


m 


<  ^  <-DJ  (pd-j/”)' 

j=0 


'v  e 


-a 


if  A  =  ni  e  m  is  bounded  as  m  and  t  tend  to  infinity.  (See 


Feller  [26],  p.  94.) 


This  problero  can  be  considered  as  being  an  extension  of  the 

« 

classical  occupancy  problem  treated  in  the  last  cited  reference. 

The  determination,  in  this  case,  of  the  expected  total  number 
of  isolates  may  be  obtained  from  what  has  been  shown  and  by  appealing 
to  symmetry. 


When  t  is  not  specified  let  Ip(m,n)  denote  the  number  of  P 
points  in  a  given  m  by  n  tournament  which  are  isolates. ,  The 

probability  that  any  particular  point  of  P  is  an  isolate  is  clearly 

l/2n.  Hence,  Ip(m,n)  has  a  binomial  distribution  and 

(2.14.4)  E[lp(m,n)]  =  m/2U  , 

and 

(2.14.5)  cr2  [Ip(m,n)]  =  m(2B-l)/22n  . 


This  indicates  the  relationship  between  the  number  of  isolates  in 
a  bipartite  tournament  and  the  relative  magnitudes  of  m  and  n. 


of  an 


if  iQ( 

ro  by  n 


m,n)  denotes  the  number  of  isolates  among  the 
tournament  then  l(m,n)  =  Ip(ro,n)  +  I,  (m,n) 


Q  points 
is  the  total 


number  of  isolates  in  a  bipartite  tournament.  Then 
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(2.1 4.6)  E[l(m,n)]  =  m/2r  +  n/2™  , 
and 

(2.14.7)  cr2[l(m,n)]  =  n,(2n-l)/22u  +  n(2n'-i)/22ro  -  2ron/2m+n  , 

from  (2.14.4),  (2.14.5)  symmetry,  and  the  fact  that  E [l0(m,n)»I^ (m,n) ]  =  0. 
(See  e.g.  Feller  [26],  pp.  215-216.) 

From  Chebyshev's  inequality  it  follows  that  if  m  and  n 
satisfy  (2.7.1)  while  tending  to  infinity  it  becomes  increasingly 
unlikely  that  a  random  bipartite  tournament  has  very  many  isolates. 

The  identical  distributions  apply  for  the  number  of  P  and  Q 
points  in  a  random  id  by  n  tournament  of  outdegrees  n  and  m  , 
respectively. 


2.15  On  the  largest  score  in  a  bipartite  tournament 

The  probability  that  a  given  P  point  in  a  random  m  by  n 

tournament  has  score  Jt,  is  clearly  (^)/2n  ,  for  Jt  =  0,  1,  . ..,  n  . 

Hence,  the  probability  that  the  score  of  such  a  point  is  not  more  than 

1  is  1  0/2"  • 
k=0 

Let  Kp(ra,n)  denote  the  maximum  of  the  scores  of  the  P  points 
in  a  random  m  by  n  tournament.  Then 


(2.15.1) 


Pr[Mp(m,n) 


k=0 


12 


m 

|  •  l/a™1 


) 


) 


for  Jt,  =  0,  1,  . . . , 


n 


the  probability  that  no  score  of  a  point  in  P 


] 


[ 
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is  greater  than  i  minus  the  probability  that  all  the  scores  of  the 
points  in  P  are  less  than  £.  Therefore, 

n 

(2.15.2)  EfMp^n)]  =  ^  i  Pr  [M^n)  =  i) 

£=0 


1=1  k=0 


The  inner  sum  in  this  expression  can  be  written  in  terms  of  an 
incomplete  beta  function,  tables  of  values  of  which  are  available,  e.g. 
Pearson  [72]. 

Even  more  difficult  would  be  the  obtaining  of  the  expected  value 
of  the  largest  of  all  the  scores  in  a  random  bipartite  graph,  not  just 
restricted  to  the  scores  of  the  points  in  one  subset  of  points.  A 
solution  to  the  corresponding  problem  for  ordinary  tournaments  does 
not  appear  to  have  been  published,  except  for  some  complicated 
recurrence  formulae  by  David  [12], 

Let  m  =  n.  Following  Gumbel  [33];  P.  82,  the  characteristic 
largest  score  among  the  P  points  would  be  that  number,  k  ,  such  that 
the  expected  number  of  P  points  whose  score  is  greater  than  or  equal 
to  k  is  one.  By  definition  it  would  be  the  number,  k  ,  such  that 

k-1 

^  (")  •  l/2n  =  1  -  1/n  . 

£=0 

Using  the  normal  approximation  to  the  binomial  distribution 
(cf.  Feller  [26],  chapter  7)  one  finds,  as  a  first  approximation,  that 


.  . ,  J 
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+  /§  log  n 
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n 

2 


2. 1 6  On  the  number  of  locally  maximal  and  locally  minimal  points  in  a 

bipartite  tournament 

In  an  m  by  n  tournament  a  point  ,  whose  score  is  greater 

than  zero,  will  be  said,  for  lack  of  a  better  term,  to  be  locally 
maximal  if,  and  only  if,  its  score  is  strictly  greater  than  the  score 
of  every  point  Q^.  such  that  P_^  — *Qj.  Replacing  the  final  "greater 
than"  by  "less  than"  defines  a  locally  minimal  P  point.  Locally 
maximal  and  locally  minimal  points  of  Q  are  defined  in  an  analogous 
fashion.  As  defined  here  the  concept  of  a  locally  maximal  point  is 
not  the  dual  of  the  concept  of  a  locally  minimal  point. 

In  this  section  we  determine  the  expected  number  of  locally 
maximal  and  minimal  points  in  a  random  bipartite  tournament,  derive 
sharp  upper  bounds  for  the  number  of  locally  maximal  points  possible  in 
a  graph,  and  count  the  number  of  nonisomorphic  bipartite  tournaments 
in  which  this  number  is  attained. 

Let  A(m,n*  P^)  be  a  variable  which  assumes  the  value  one  or 
zero  according  as  to  whether  P^  is  or  is  not,  respectively,  a  locally 
maximal  point  in  a  random  m  by  n  tournament.  Let  B(m, n*  P  )  be 
defined  similarly  according  as  to  whether  P_^  is  a  locally  minimal 
point.  The  same  variables  may  be  defined  for  points  of  Q.  Let  v^ 
denote,  as  usual,  the  score  of  P^  ,  i  =  1,  2,  ...,  m  .  The  same  type 
of  reasoning  as  was  used  in  §§2.l4  and  2.1p  implies  that 
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and 


Pr{A(n,,n;P.)  =  l|v.  =  k] 


m-1 


£=k 


Pr{B(ni,n;Pi) 


for  k  =  1,  2,  . ..,  n  ,  and  the  left  members  of  both  equations  equal 
zero,  by  definition,  when  k  =  0. 


Summing  over  k  gives  the  result  that 

n 

(2.16.1)  Pr{A(m,n;P.)  =  l]  =  ^ 

k=l 

and 

n 

Pr[B(iD,n;P.)  -  1}  -  ^  (“)•  l/2n 

k=l 

To  obtain  corresponding  results  for 
interchange  m  and  n.  It  follows  that  the  expected  number  of  locally 
maximal  and  locally  minimal  points  in  a  random  m  by  n  tournament  is 

m  Pr{A(m,n;P_^)  =  l}  +  n  Pr{A(m,n*Q^)  =  l]  , 

and 

m  Pr{B(m,n;P_^)  =  l]  +  n  Pr{B(m,n^Q^)  =  l)  ,  respectively. 

For  m  =3  and  n  =  2  these  values  are  139/64  and  24/64, 
respectively.  These  numbers  are  in  agreement  with  an  actual  count  of  the 
things  involved  over  the  set  of  sixty-four  3  by  2  tournaments. 


m-1 


r  -|k 

1  -  /  •  Va™'1 


J=k 


1  "  L  *  V2™”1 


£=0 


it  is  necessary  only  to 


: 


. 
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Since  some  point  must  have  a  score  smaller  than  the  score  of  any 
other  point  it  follows  that  not  all  of  the  points  in  an  m  by  n 
tournament  can  be  locally  maximal  simultaneously. 

If  m,n  >  1  consider  the  bipartite  tournament  in  which 


P.  -*  Q . 
i  J 

and  Q  — >  P^  , 

for 

i  = 

2, 

...,  m  ,  and  j  =  1,  ...,  n. 

If  n  = 

1  let  P.  ->  Q1 

l  1 

for 

all 

i. 

These  tournaments  contain  m+n-1 

locally  maximal  elements.  This  implies  the  following  result! 

Theorem  2.16.1.  The  maximum  number  of  locally  maximal  points 

possible  in  a  nontrivial  m  by  n  bipartite  tournament  is  m+n-1. 

We  now  proceed  to  determine  the  number  of  nonisomorphic  m  by  n 
bipartite  tournaments  containing  m+n-1  locally  maximal  points,  where 
m,n  >  2. 


First  we  observe  that  in  such  a  graph  there  is  precisely  one 
point  whose  score  is  less  than  the  scores  of  the  remaining  points,  since 
there  is  at  least  one  such  point  and  if  there  were  more  than  one  there 
would  be  fewer  than  m+n-1  locally  maximal  points.  Assume  temporarily 
that  this  point  is  P^.  Next  we  observe  that  among  the  remaining  points 
it  is  impossible  for  a  P  point  and  a  Q  point  to  have  the  same  score 
and  both  be  locally  maximal  points.  Hence,  those  points  with  the  next 
to  the  smallest  score  must,  in  this  case,  all  be  Q  points  and, 
furthermore,  they  must  all  have  score  one  to  be  locally  maximal  since 
P-^  is  the  only  point  having  a  lower  score.  This  further  implies  that 
P^  must  have  score  zero.  Next,  those  points  with  the  third  lowest 
score  in  this  case,  if  there  are  any  such  points,  must  all  be  P  points 


"  -  „  . 
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and  each  of  their  scores  roust  equal  the  number  of  Q  points  which  had 
the  second  lowest  scores.  In  order  for  these  P  points  to  be  locally 
maximal  there  must  be  at  least  two  such  Q  points  with  score  one. 

The  same  type  of  considerations  repeated  as  often  as  is  necessary 
suffice  to  establish  the  validity  of  the  following  assertion:  In 

every  nontrivial  m  by  n  bipartite  tournament  containing  m+n-1 
locally  maximal  points,  in  which  is  the  point  with  minimal  score, 

the  points  of  P  and  Q  may  be  separated  into  k  and  £  mutually 
exclusive  and  exhaustive  non-vacuous  classes,  where  the  ith  and  jth 
class  contain  r_^  and  s  points,  respectively. 

X  A 

(2.16.2)  k-£  =  1  or  0  ,  ^  ri=m,  ^  s.=n, 

i=l  j=l 

and  such  that  the  edges  joining  the  points  in  the  ith  class  of  P 
points  to  the  points  in  the  first  i-1  classes  of  Q  points, 
i  =  2,  ...,  k  ,  are  oriented  towards  the  Q  points  and  all  the 
remaining  edges  are  oriented  towards  P  points.  In  order  for  all  the 
points  with  the  exception  of  P^  to  be  locally  maximal  it  roust  also 
be  the  case  that 


<  Sl+S2 

<  r1+rg+r. 

<  Sj+S 

2+sj 

<  ... 

rl  +  r2 

+  .  .  .  + 

;  if 

k  = 

1 ; 

S1  +  S2 

+  .  .  .  + 

Si 

>  if 

k  = 

u 1  . 

Furthermore,  each  such  tournament  determines  uniquely  the  r^'s 
and  sj*s  >  in  or(3er,  and  the  converse  as  well  is  true  up  to  an 
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isomorphism.  Therefore,  the  number  of  nonisomorphic  m  by  n  tournaments 
containing  ro+n-1  locally  maximal  points  such  that  is  the  point 

with  the  smallest  score  is  the  number  of  ordered  sets  of  strictly 
positive  integers  satisfying  (2.16.2)  and  (2.16.5). 

If  k  =  l  =  r  ,  let 

h_  Jh 

J2h  =  Si  and  J2h-l  “  1  rj  '  h  *  r  • 

1=1  j=l 

Then  the  conditions  in  (2.16,5)  may  be  rewritten  as 

(2.16.4)  1  =  Ji  <  J2  <  <  J4  <  »..  <  J2r-1  =  “  *  311(1  J2r  =  n  * 

By  definition  it  must  be  that  ^o(r  —  min  [id,  n]  -  1  =  T)  . 

In  this  case  it  is  easily  seen  that  there  is  a  one-to-one 
correspondence  between  these  sets  of  numbers  and  those  satisfying 
(2.16.2)  and  (2.16.5). 

Now  let  ji  =  Jij_1  -  Ji  ,  for  i  =  1,  2,  . ..,  2(r-l)-l  =  2r-5. 
Since  >  1  it  follows  that  (j^,  j^,  . ..,  _.)  forms  a  composition 

of  ^-1  into  2r~5  parts.  (r  is  at  least  two  in  the  case  we 

are  considering. )  Again  there  is  a  one-to-one  correspondence  between 

and  the  sets  of  the  J..  But  for  a 

l 

the  number  of  compositions  of  J^,  „  ..-1 

2(r-l ) 


the  sets  consisting  of  the  j. 


fixed  value  of  r  and  J 


2(r-l) 


into  2r-5  parts  is 


J2(r-1 )"2 
2(r-2) 


as  was  mentioned  in  §2.10 


'  > 


I 

L 


It  is  not  difficult  to  see  that  r 


in  this  case ,  can  vary 


Summing  over  these  cases  counts  the  number  of  m  by  n  tournaments 


satisfying  the  given  conditions  and  containing  m+n-1  locally  maximal 
points.  Only  slight  modifications  are  necessary  to  treat  the  case 
where  k  =  i+l0  Combining  these  gives 


Theorem  2.16.2. 


The  number  of  nonisomorphic  m  by  n  tournaments 


m,n  >  2  ,  containing  only  one  point#,  a  P  point,  which  is  not  a 
locally  maximal  point  is 


(  j’2) 
'2r-3'  ' 


where  3  =  min {[m/2]  ,  [n/2]}. 

The  1  counts  the  tournament  given  as  an  example  in  the  proof 
of  Theorem  2.16.1  and  corresponds  to  the  only  graph  where  r  =  1  for 
these  cases.  When  m  or  n  is  less  than  two  the  problem  is  trivial. 
The  symmetry  of  the  above  expression  in  m  and  n  indicates  that  it 
also  represents  the  number  of  tournaments  in  which  every  point  is  a 
locally  maximal  point  except  for  a  single  Q  point  . 


Corresponding  results  about  the  number  of  locally  minimal  points 


possible  in  a  bipartite  tournament  appear  to  be  more  difficult  to 
obtain.  The  m  by  m  tournament  in  which  P^  — if,  and  only  if, 
i  =  j  ,  for  i,j  =  1,  2,  ...,  id  contains  m  locally  minimal  points 
for  ro  >  3  and  this  can  be  extended,  except  for  a  few  small  values  of 
ro  and  n  ,  to  show  that  it  is  possible  in  general  for  an  m  by  n 


t.  j  ,  ;  |]}  . 


-  ... 


- 
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tournament  to  have  as  many  as  max[m,n]  locally  minimal  points.  A 
conjecture  is  that  this  is  the  maximum  number  it  could  have  although  the 
only  upper  bound  on  the  number  of  locally  minimal  points  possible  in  an 
m  by  n  tournament  that  we  have  been  able  to  establish  is  roax[m,n] 

+  mn/(ro+n).  This  arises  from  the  observation  that  if  is  the  point 

with  lowest  score  in  a  bipartite  tournament  then  no  point,  ,  such 

that  — ►  P_^  can  be  locally  minimal,  and  the  lowest  score  in  an 

m  by  n  tournament  is  certainly  no  more  than  mn/(m+n). 

2.17  A  measure  of  the  degree  of  similarity  between  different  bipartite 

tournaments 

Kendall’s  t  ,  [55] >  provides  a  measure  of  the  degree  of 
correlation  between  two  different  rankings  of  a  set  of  n  objects. 

The  distribution  of  t  under  the  hypothesis  that  all  rankings  are 
equally  likely  has  been  investigated  in  order  to  test  the  significance 
of  various  values  of  r  in  a  given  situation.  In  this  section  we 
extend  some  of  these  considerations  to  where  each  element  in  one  set 
is  compared  with  each  element  in  a  second  set  and  a  choice  is  made  for 
each  such  pair  as  to  which  is  the  preferred  element,  with  respect  to 
some  criteria.  We  shall  use  the  terminology  of  bipartite  tournaments 
as  they  provide  a  model  of  such  situations. 

Let  there  be  given  two  id  by  n  tournaments  both  of  whose  point 
sets  are  labelled  P  =  [P^,  ...,  P^}  and  Q  =  [Q^,  ...,  Qn).  Let  S 
and  N  denote,  respectively,  the  number  of  edges  (P^,  Q^)  which  are 
oriented  in  the  same  sense  and  in  the  opposite  sense  in  the  two  graphs. 


Then  S-N  may  vary  between  +mn  and  -ran  and 


. 


L 


.  :  . 
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(2.17.1)  T  .  -  1-S  .  £  -  ! 

m  n  ran  ran 

varies  between  +1  and  -1  ,  when  there  is  complete  agreement  and 
complete  disagreement  between  the  two  graphs,  respectively,  providing 
what  may  be  considered  as  a  coefficient  of  correlation  between  the  two 
tournaments. 

This  can  be  interpreted  as  being  a  special  case  of  the  general 
correlation  coefficient,  (see  Kendall  P.  IT) 


where  a.  .  is  +1  or  -1  according  as  to  whether  P.  — ►  Q  .  or 
ij  i  J 

-»  P_^  ,  respectively,  in  the  first  tournament  and  b  is  similarly 
defined  with  respect  to  the  second  tournament,  for  i  =  1,  . . . ,  m  , 
and  j  =  1,  . . . ,  n. 

Under  our  hypothesis  of  randomness  the  distribution  of  r  is 
easy  to  obtain,  it  being  simply  a  binomial  distribution  and  a  special 
case  of  a  more  general  distribution  to  be  treated  shortly.  The  central 
limit  theorem  implies  that  the  distribution  of  (mn)^  r  tends  to  the 
normal  distribution  with  zero  mean  and  unit  variance  as  ran  tends  to 
infinity. 

Admitting  the  possibility  of  a  tie  between  two  elements  being 
compared,  which  may  be  indicated  by  the  non-existence  of  an  edge  joining 
the  corresponding  points  in  the  corresponding  tournament,  introduces 
complications.  In  the  first  place  even  the  concept  of  correlation 


V 


T 


. 
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between  two  sets  of  rankings  one  or  both  of  which  contain  ties  admits  of 
various  interpretations.  For  a  discussion  of  this  problem  see  the  third 
chapter  of  the  last  cited  reference  and  also  Burr  [4], 

However,  for  our  purposes,  the  following  is  as  simple  an 
extension  as  any  of  the  definition  of  t  to  include  the  case  where 
ties  are  permitted.  If  and  are  joined  by  an  edge  in  neither 

graph  the  contribution  of  this  pair  to  S  is  one,  i.e.  it  counts  as 
an  agreement.  If  they  are  joined  by  an  edge  in  only  one  of  the  graphs 
this  will  contribute  nothing  to  either  S  or  N.  In  other  cases  the 
earlier  definition  applies. 


Let  the  two  m  by  n  tournaments  being  compared  contain  t  and 


u  ties,  respectively,  where  t,  u  =  0,  1,  ...,  mn.  By  considering 
separately  the  expected  contribution  of  pairs  of  points  which  are 
joined  by  an  edge  in  neither  graph  and  those  which  are  joined  in  both 
graphs  the  expected  value  of  t  under  the  hypothesis  of  randomness  is 
found  to  be 

(2.17.2)  E(t)  =  2[tu/mn  +  ^  (mn-t)(ron-u)//mn]  /  mn  -  1 


~  tu  -  (t+u)ron 
(ron)^ 


In  determining  the  variance  of  r  under  these  conditions  let 
denote  the  contribution  to  S  of  the  pair  of  points,  (P^,  Qj). 
As  before  it  follows  that 


tu 


1  (mn-t ) (rnn-u) 


+ 


-  . 


•  X 


- 


:  ••  • 


T 


\  ‘  -  ■  v  \  1 
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Also, 

^(X.j)  =  EfX.j)  •  [1  -  E(X.  ,)]  , 

since  X. .  has  a  binomial  distribution  for  fixed  values  of  m,  n,  t, 
ij  9  9  * 

and  u. 


Furthermore,  if  not  both  i  =  i  and  j  =  k  , 


t(t-l)  u(u-l)  tu(mn-t)(mn-u 

2  + 

[mn(ron-l)]'<:  [ran(mn-l)] 

JL  (mn- 1 )  (mn- 1- 1 )  (ton-  u  )  (ron-  u- 1 ) 

+  4  r  /  ,  x,2  * 

[mn^mn-l J J 

counting  separately  the  contributions  from  two  pairs  of  points  both 
pairs  of  which  are  tied  in  both  graphs,  two  pairs  of  points  one  pair 
of  which  is  tied  in  both  graphs  and  the  other  pair  of  which  contributes 
one  to  S  by  virtue  of  the  fact  that  the  edges  joining  the  pair  are 
oriented  the  same  in  both  graphs,  and  those  pairs  of  points  both  pairs 
of  which  contribute  to  S  because  of  the  agreement  of  the  orientations 
of  the  edges  joining  both  pairs  in  the  two  graphs. 

cov  (X.  .  X„  )  =  E(X.  .  X„  )  -  E*(X.  .)  , 

lj  xik  1J  Jok  xj 

from  the  definition. 

Therefore,  (see  e.g.  Feller  [26],  p.  216) 


E(X.  .  Xfll  ) 
V  ij  ik; 


(2.17.3) 


cj^(t)  = 


(mn  Y 


cr2(s)  = 


[mn  a^(X.  .)  +  2(^)cov(X. 

ij  2  ij 


(ron)' 


X«c» 


If  t  =  u  ,  then  t  measures  the  correlation  between  two 


- 

~~ 

j 

-■ 

-  -I  t  . 

- 

• 

» 

J  .  . 
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incomplete  m  by  n  tournaments,  each  of  which  contains  mn-t  oriented 
edges.  Carrying  through  the  substitutions  for  this  case  gives,  barring 
errors,  that 


(2.17.4)  E(t)  = 


t(5t-2mn) 

(mn)2 


and 


o-2(r) 


2  2  4 

(mn)  (mn-1 ) (mn-2t)  +  mnt  ( lOmn- l8t- 1 )  +  9t 


(mn)  (mn-l) 


In  particular,  if  t  =  u  =  0  this  corresponds  to  the  case 
considered  originally. 

Kendall  has  also  considered  what  can  be  thought  of  as  being  a 
coefficient  of  agreement  between  R  ordinary  tournaments  on  n  points. 
This  can  easily  be  extended  to  bipartite  tournaments  where,  for 
simplicity,  we  restrict  ourselves  to  the  case  where  there  are  no  ties 
permitted. 

Let  there  be  given  R  ,  an  arbitrary  integer  greater  than  one, 
m  by  n  tournaments  all  of  whose  point  sets  are  labelled  P  =  (P^,  ...,  P^} 
and  Q  =  {Q^,  ...,  Q^).  Let  7  denote  the  number  of  these  tournaments 
in  which  P^  -*Q^.  >  for  i  =  . ..,  and  j  =  . ..,  n.  Then 

I  ■  I  ^J>  *  - 

i;  j 

is  the  total  number  of  pairs  of  agreements  in  the  orientation  of 
corresponding  edges  among  the  R  tournaments.  A  coefficient  of 


agreement,  u 


can  be  defined  by 
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(2.17.5)  u  =  2  L  -  1  , 


which  is  the  same  as  t  when  R  =  2. 


From  the  definition  it  follows  that  the  minimum  value  u  may 
assume,  if  R  >  2  ,  is  -l/(R-l)  or  -l/R  ,  according  as  to  whether 
R  is  even  or  odd,  respectively,  since  it  is  impossible  for  there  to  be 
complete  disagreement  as  regards  the  orientation  of  the  edges  among 
more  than  two  tournaments. 

Under  our  hypothesis  of  randomness  it  can  be  shown  that 

(2.17.6)  E(^)  =  \  ^  Wn  * 

and 

(2.17.7)  o-2  (^  )  =  \  (*)  mn  . 

Also,  the  statistic 


1  /R\  R-3 

2  ”  (2}  5^2 


for  R  >  2  , 


has  as  its  first  three  moments  the  first  three  moments  of  the 
X2  -  distribution  with 

v  =  ran  R(R-l)/(R-2)2 
degrees  of  freedom. 

Except  that  (^)  is  replaced  by  mn  the  proofs  of  these  statements 
are  almost  identical  with  the  proofs  of  the  corresponding  statements  for 
ordinary  tournaments  outlined  in  Kendall  [5>]>  PP.  and  for  that 

reason  will  be  omitted. 


.  . 


■  .  .  •-  .•  ; 

.  ' 
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CHAPTER  III 

ON  RANDOM  BIGRAPHS 


3. 1  Introduction 

Given  that  a  graph  has  n  points  and  E  edges,  from  these 
facts  alone  what  can  be  inferred  about  the  structure  of  such  a  graph, 
i.e.  what  properties  does  it  possess  and  what  is  the  nature  and  number 
of  configurations  appearing  as  subgraphs  of  such  a  graph?  Many 
problems  of  this  general  type  have  been  considered  and  in  this 
concluding  chapter  we  treat  a  few  of  these  and  related  types  for 
bigraphs. 

3.2  On  chromatic  bigraphs 

Let  there  be  given  two  distinct  sets  of  n  points  each  such 
that  joining  each  pair  of  points  not  both  in  the  same  set  is  an  edge 
which  is  coloured  either  red  or  blue,  say,  but  not  both.  In  such  a 
configuration,  called  a  chromatic  bigraph,  what  is  a  lower  bound  for 
the  total  number  of  monochromatic  quadrilaterals,  i.e.  cycles  of 
length  four,  the  four  edges  of  which  are  of  the  same  colour? 

It  is  easily  seen  how  there  may  be  set  up  a  one-to-one 
correspondence  between  chromatic  bigraphs  and  bipartite  tournaments. 

In  particular,  with  every  chromatic  bigraph  may  be  associated  an 
adjacency  matrix  of  0* s  and  l*s. 

For  ordinary  chromatic  graphs  the  corresponding  problem  with 
respect  to  triangles  was  solved  by  Goodman  [32]  and  later  by  Sauve  [69]. 


;  •  '•  ... 
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In  that  case  the  number  of  monochromatic  triangles  may  be  expressed  as 
a  function  of,  say,  the  number  of  red  edges  incident  on  each  point, 
but  the  two  matrices  given  in  §2.12  can  be  interpreted  as  corresponding 
to  chromatic  bigraphs  each  of  which  has  the  same  number  of  red  edges 
incident  on  each  point  but  which  do  not  contain  the  same  number  of 
monochromatic  quadrilaterals. 

For  convenience  we  may  equivalently  interpret  our  problem  in 
terms  of  n  by  n  matrices  each  of  whose  entries  is  either  0  or 
1  and  where,  for  any  such  matrix,  R(n)  and  B(n)  denote  the  number 
of  2  by  2  minors  all  of  whose  entries  are  l,s  and  0* s,  respectively. 

We  will  prove  the  following 


Theorem  3.2.1. 


R(n)  +  B(n)  > 


2u2(u-1)(4u-3) 
2u5(4u-3) 
u  (2u+1)(4u2-u-1) 
u2(2u+1)(4u+3)  , 


if  n  =  4u  ; 

if  n  =  4u+l  ; 

if  n  =  4u+2  ; 

if  n  =  4u+3  ) 


where  u  is  a  nonnegative  integer. 

In  any  n  by  n  matrix  of  O'  s  and  1*  s  ,  A  =  |ja_||  ,  let  r^ 

denote  the  ith  row  sura.  Then  the  number  of  pairs  of  distinct  columns, 

k  and  Jt,  ,  for  which  there  exists  a  row  i  such  that  a.,  =  a.  „  is, 

lk  i  £  ’ 

counting  multiplicities, 


(3.2.1)  ^  Kg1)  +  (V1)1  • 

1=1 


If  the 


pairs  of  distinct  columns  have  been  numbered  let 


. 


. 
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t  and  h  denote  the  number  of  tiroes  the  vth  pair  has  been  counted 
v  v 

in  (5.2.1)  with  a^  =  ai^  =  1  and  a  ^  =  a  ^  =  0  ,  respectively. 


Then 


(") 


(3.2.2)  £  CV  -  I  1  hv  ■  I  (V1)  ’ 


V=1 


i=l 


v=l 


i=l 


and 


(n) 

'2' 


(3.2.5)  R(n)  +  B(n)  =  ^  [(/)  +  (/)]  . 


v=l 


n 


Hence,  to  minimize  R(n)  +  B(n)  we  should  have  ^  r^  as 

A  i=1 

nearly  equal  to  ^  (n-r^)  as  possible,  the  r^’s  as  nearly  equal  to 

i=l 

each  other  as  possible,  the  t^’s  as  nearly  equal  to  each  other  as 
possible,  and  similarly  for  the  h^’s. 

When  n  =  4u  the  smallest  possible  value  for  R(n)  +  B(n) 

would  occur  when  all  r.  =  2u  and  t  was  equal  to  u  for 

1  v 

2 

2u(5u-l)  values  of  v  and  u-1  for  the  remaining  2u  values  of 
v  ,  since 


_n 

u[2u(3u-l)]  +  (u-l)(2u2)  =  4u  (2“)  =  ^  ^  ’ 


i=l 


and  similarly  for  the  h^’s.  Hence,  for  any  matrix  in  this  case 


(3.2.4)  R(n)  +  B(n)  >  2[2u(5u-l)  Q)  +  2u2  (U‘X)] 


=  2u  (u-1)(4u-5)  . 


. 
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In  the  same  way  it  is  found  that  if  n  =  4u+2  the  minimum 

would  occur  when  all  r.  =  2u+l  ,  t  was  equal  to  u  for 

1  3  v  n 

(2u+l)(3u+l)  values  of  v  and  u+1  for  the  remaining  u(2u+l) 
values,  and  similarly  for  the  h^'s.  In  this  case 

(5.2.5)  R(n)  +  B(n)  >  2[ (2u+l)(3u+l)  (“)  +  u(2u+l)  (U+1)] 

=  u(2u+1)(4u2-u-1)  . 


When  n  =  4u+3  it  is  obviously  impossible  to  have 
n  n 


^  r_^  =  ^  (n-r^).  The  nearest  to  equality  that  these  two  could 

i=l  i=l 

come  would  be  if  r_^  ,  say,  was  2u+2  for  2u+2  values  of  i  and 

2u+l  for  the  remaining  2u+l  values.  The  other  conditions,  in  this 

case,  would  require  that  t^  be  equal  to  u+1  for  2(u+l)(2u+l) 

2 

values  of  v  and  u  for  the  remaining  (2u+l)  values  and  that  h 

2 

be  equal  to  u+1  for  (2u+l)  values  of  v  and  u  for  the  other 
values.  Hence,  in  this  case. 


(5.2.6)  R(n)  +  B(n)  >  (2u+l)(4u+3)  U^1)  +  ()J)]  =  u2(2u+l)(4u+3)  . 


When  n  =  4u+l  the  same  sort  of  reasoning  implies  that  the 

minimum  value  of  R(n)  +  B(n)  would  occur  when,  say,  r^  was  equal 

to  2u+l  for  2u+l  values  of  i  and  2u  for  the  remaining  values; 

2 

t^  was  equal  to  u  for  8u  +u  values  of  v  and  u+1  for  the 

2 

other  u  values*  and  when  h  was  equal  to  u  for  8u  +u  values  of 
>  v 

v  and  u-1  for  the  remaining  u  values.  As  a  lower  bound  this  gives 

(3.2.7)  R(n)  +  B(n)  >  2(8u2+u)  (")  +  u[(U^)  +  (“-1)] 


u[(4u+1)(u-1)(2u)+1]  . 


•  -  'r  ' 
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But  if  u  >  0  it  is  impossible  for  equality  ever  to  hold  here. 

To  see  this  we  notice  that,  by  symmetry,  a  matrix  for  which  R(n)  +  B(n) 

is  to  attain  the  minimum  in  (3.2.7)  must  have,  in  this  case,  2u 

2u 

columns  whose  column  sum  is  2u.  For  each  of  the  (0  )  pairs  of 

such  columns  t  is  u  or  u+1  and  h  is  u-1  or  u.  Upon 

attempting  to  construct  two  columns  in  api  4u+l  by  4u+l  matrix 

each  containing  2u  ones  and  2u+l  zeros  it  is  not  difficult  to  see 

that  none  of  these  combinations  of  t  and  h  are  possible,  and 

v  v 

that  the  next  best  combinations  from  the  point  of  view  of  minimizing 

the  sum  is  (3.2.3)  are  t  =  u  and  h  =  u+1  or  t  =  u-1  and 

h^  =  u.  For  each  h  =  u+1  ,  for  some  of  these  pairs  of  columns, 

there  must  be  an  extra  h^  =  u-1  in  order  to  preserve  equality  in 

(3.2.2)  and  a  similar  remark  can  be  made  with  respect  to  those  t^ 

which  equal  u-1.  Clearly  there  were  enough  t^’s  and  h^'s 

originally  set  equal  to  u  to  permit  all  the  necessary  changes. 

Since  (U^)  +  (Ug^)  =  2(^)  +  1  ,  we  see  that  whichever  of  these 

changes  we  make  we  are  forced  to  increase  the  sum  in  (3.2. 30  by  one 

2u 

for  each  of  the  (^  )  pairs  of  distinct  columns  whose  column  sum  is  2u. 

Adding  this  to  the  lower  bound  obtained  previously  in  (3.2.7) 
gives  the  result,  when  n  =  lj-u+1  ,  that 

(3.2.8)  R(n)  +  B(n)  >  2u:'  (4u-3)  , 

which  completes  the  proof  of  Theorem  3.2.1. 

It  would  be  of  interest  to  be  able  to  give  a  general  construction 
which  would  show  that  the  bounds  given  in  this  theorem  are  sharp  or  to 
give  larger  bounds  if  they  are  not.  That  they  are  not  far  from  being 
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2 

best  possible  is  implied  by  the  fact  that  if  each  of  the  n  edges 

is  equally  as  likely  to  be  coloured  red  as  it  is  to  be  coloured  blue 

1  n  ^ 

then  the  expected  value  of  R(n)  +  B(n)  is  —  (^)  ,  which  is  only 

slightly  larger  than  the  lower  bounds  derived  in  the  theorem. 


The  following 

9  by  9 

matrix 

1 

1 

1 

1 

0 

0 

0 

0 

0 

1 

0 

0 

0 

1 

1 

1 

0 

0 

1 

0 

0 

1 

0 

1 

0 

1 

1 

1 

0 

1 

0 

1 

0 

0 

1 

1 

A  = 

0 

1 

0 

1 

1 

0 

1 

0 

1 

0 

1 

1 

0 

0 

1 

1 

0 

1 

0 

1 

0 

0 

1 

1 

0 

1 

0 

0 

0 

1 

1 

0 

0 

1 

1 

0 

0 

0 

1 

1 

1 

1 

0 

0 

1 

contains  forty-four  2  by  2  minors  each  of  whose  entries  is  1  and 
thirty-six  2  by  2  minors  each  of  whose  entries  is  0  which  shows 
that  equality  may  occur  in  the  bounds  given  for  R(n)  +  B(n)  when 
n  =  9.  Other  examples  suffice  to  show  that  the  bounds  are  sharp  for 

n  <  9. 


The  problem  treated  here  as  well  as  the  argument  used  admit 
of  obvious  generalizations. 

Related  to  this  problem  is  one  due  to  Zarankiewicz  [93]  which 
may  be  expressed  as  follows!  Find  a  function,  A(m,nj  k , &)  ,  such 
that  every  ro  by  n  matrix  of  0's  and  l's  which  contains  more  than 
A(m, n;  k, i)  l's  contains  a  k  by  i  minor  all  of  whose  entries  are 
l's.  From  the  first  parts  of  (3.2.1)  to  (3.2.3)  it  follows  that 
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if  an  n  by  n  matrix  of  0' s  and  V s  contains  A  V s  then  the  number 


of  2  by  2  minors  all  of  whose  entries  are  l's  contained  in  the  matrix 
is  at  least 


This  can  be  solved  for  A  in  terms  of  n  to  determine  when  it  is 
positive.  Doing  so  gives  the  result  that 


(3.2.10)  A(n,n,*  2,2)  <  §  (l  +  Jkn-3)  . 


This  has  been  shown  by  Rieman  [82], 

More  generally  the  same  type  of  reasoning  can  be  extended  to 
show  that 


(3.2.11)  A(m,nj  k ,  A)  <  (k-l)n  +  (i-l)1^  n1-1/k  m  , 


which  has  been  derived,  using  a  somewhat  different  approach,  by 
Hylt£n-Cavallius  [48],  Other  partial  results  are  contained  in  papers 
by  Kovari,  S6s,  and  Turcin  [38]  and  “Gulik  [10], 

Another  related  problem  has  to  do  with  unit  graphs,  by  which  is 
meant  a  graph  in  the  Euclidean  plane  such  that  every  pair  of  points 
which  are  adjacent  are  a  unit  distance  apart,  measured  in  the 
ordinary  sense.  Moser  [67]  has  shown,  among  other  things,  that  if 
an  ordinary  graph  on  n  points  with  E  edges  is  an  unit  graph,  then 


(3.2.12)  E  <  |  (1  +  +  7  )  , 


improving  an  earlier  result  due  to  Erdos  [18], 
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For  ra  by  n  unit  bigraphs  a  similar  approach  may  be  used 
although  the  result  which  is  thus  obtained  and  (3.2.12)  are  both 
quite  likely  far  from  being  best  possible.  The  critical  step  is  to 
observe  that  for  each  of  the  (™)  pairs  of  distinct  P  points  it  is 
impossible  for  the  points  of  such  a  pair  to  be  both  joined  to  more 
than  two  Q  points  at  a  unit  distance  from  each  of  them.  This  provides 
an  upper  bound  for  the  number  of  Q  points  such  that  there  are  two  P 
points  at  a  unit  distance  from  it  which  are  adjacent  to  it?  counting 
multiplicities.  A  lower  bound  for  this  same  quantity  can  be  obtained 
in  terms  of  the  number  of  edges  the  graph  contains  by  the  means  used 
to  obtain  (3.2.1)  and  (3.2.2).  Solving  the  resulting  inequality, 
the  details  of  which  we  omit,  gives  the  result  that  it  an  ra  by  n 
unit  bigraph  contains  E  edges  then 

(3.2.15)  E  <  *+&+  . 

m  and  n  may  be  interchanged  in  this  inequality  since  the  original 
problem  is  symmetric  in  m  and  n. 

3.3  On  the  largest  monochromatic  subgraph  of  a  chromatic  bi graph 
A  set-theoretical  theorem  due  to  Ramsey  [73]  implies,  among 
other  things,  the  existence  of  a  function,  N(k, £)  ,  where  k  and  £ 
are  positive  integers,  such  that  any  ordinary  chromatic  graph  with  at 
least  N(k, £)  points  contains  either  a  complete  subgraph  on  k  points 
all  of  whose  edges  are  red  or  a  complete  subgraph  on  £  points  all  of 
whose  edges  are  blue.  Various  bounds  have  been  derived  for  N(k, i). 

See,  for  example,  Erdos  and  Szekerts  [17],  Erdos  [19]>  [20],  [21], 
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and  [2k],  and  Greenwood  and  Gleason  [31]  for  some  of  these  and  related 
results.  In  this  section  a  special  case  of  a  corresponding  problem  for 
chromatic  bigraphs  will  be  considered. 

As  in  previous  sections  it  will  be  more  convenient  to  express  the 
problem  in  terms  of  matrices.  Let  k(m,n)  be  the  largest  number,  k  , 
such  that  every  m  by  n  matrix  of  O' s  and  l's  contains  a  k  by  k 
minor  all  of  whose  entries  are  the  same.  With  no  loss  of  generality 
we  may  assume  that  m  >  n. 

Lemma  3.3.1.  k(5,5)  =  2  . 

Each  row  in  a  5^5  matrix  of  0' s  and  l's  contains  at  least 
three  identical  entries.  Hence,  there  are  at  least  three  rows  each 
of  which  contains  at  least  three  l's  ,  say.  Essentially  the  only  way 
of  filling  out  two  rows  of  such  a  matrix,  each  containing  three  l's, 
without  having  a  2  by  2  minor  involving  these  rows  all  of  whose 
entries  are  identical  is,  say,  if 

bll  =  b12  ■  bl5  =  1  '  bl4  b15  ”  0  > 

b21  =  b22  =  0  ’  b23  =  b24  =  b2J  =  1  ’ 

where  b. .  denotes  the  element  in  the  ith  row  and  ith  column  of  the 

matrix.  But  it  is  not  difficult  to  see  that  it  is  impossible  to  add 
a  third  row  containing  at  least  three  l*s  without  having  at  least  one 
2  by  2  minor  all  of  whose  entries  are  identical.  This  shows  that 
5)  >  2  ,  which  also  is  a  consequence  of  Theorem  3.2.1. 

That  k(5,5)  <2  is  shown  by  the  following  matrix! 
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1110  0 


0  0  111 


A 


10  10  1 


10  0  10 


0  10  0  1 


This  lemma  could  be  thought  of  as  being  the  bipartite  analogue 
of  the  statement  that  of  every  six  people  at  least  three  are  either 
mutual  acquaintances  or  mutual  strangers.  (See  e.g.  Goodman  [32].) 


We  now  restrict  ourselves  to  the  case  that  m,n  >  p  and  shall 


i 


further  assume  that  log  ro  =  o(n2  )  as  to  and  n  tend  to  infinity. 


If  every  m  by  n  matrix  of  0* s  and  l*s  contains  a  k  by  k 
minor  of  identical  elements  then  the  number  of  ways  of  forming  such 
a  minor  times  the  number  of  ways  of  choosing  all  the  remaining  entries 
of  the  matrix  is  at  least  as  great  as  the  total  number  of  ro  by  n 
matrices  of  0' s  and  l's.  That  is 


>  2 


,mn 


or 


> 


since  k  >  2  for  m,n  >  5 


This  implies  that 


(3.3.1)  k(ro,n)  <  log2(ron)  . 


Next  consider  an  m  by  n  matrix  of  0*s  and  l's  which  contains 


no  k+1  by  k+1  minor  all  of  whose  entries  are  identical.  Such  a  matrix 


* 

-  :■  * 

„  . 


. 


r: 

f 

: 

.  . 


-  133  - 


exists  from  the  definition  of  k(m,n).  With  no  loss  of  generality  we 
may  assume  that  this  matrix  contains  at  least  as  many  l's  as  it  does 
0's.  Hence 


I 


i=l 


r .  >  mn/2 

1  —  ' 


where  r.  denotes  the  ith  row  sum.  The  number  of  (k+l)- tuples  of 


r . 


l's  in  the  ith  row  is  (.  ^).  Then  it  must  be  that 

k+l ' 


m 


r . 


^k+1 ^  -  k^k+l^  ' 


i=l 


for  otherwise  there  would  be  at  least  one  (k+l)-tuple  of  columns  such 
that  k+l  or  more  rows  all  had  l*s  in  these  columns,  forming  a 
k+l  by  k+l  minor  of  l-’s  contrary  to  the  assumption.  The  sum  in  the 
left  member  of  this  inequality  is  minimized  when  all  the  r^  are 
equal  to  n/2.  Therefore, 


ra 


(n/2 

'■k+l 


)  <  k  ( 


n 

k+l 


)  . 


l. 

From  (3.3.1)  and  the  assumption  that  log  ro  =  o(n2 )  it 

1. 

follows  that  k  =  o(n2 )  as  ro  and  n  tend  to  infinity.  This  being 
the  case  it  follows  from  Stirling's  formula  that  for  sufficiently  large 
m  and  n  the  last  inequality  is  equivalent  to 


/  /0\k+l  k+l 

(n/2)  ^  ,  n 

ro  — 1 - — - —  <  k  - 

(k+i):  (k+i): 

or 

(3.3.2)  m  <  k  2k+1  . 


. 


_ 
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Combining  (3.3.1)  and  (3.3.^)  gives 
Theorem  3.3.1.  For  every  €  >  0  the  following  inequality  holds  for 

_i 

suff iciently  large  m  and  n  if  log  m  =  o(n2 )  as  m  and  n  tend 
to  infinity. 

(l-e)  log^m  <  k(m,n)  <  log2(ron)  . 

Various  generalizations  may  be  treated  in  a  similar  fashion. 

3.4  Threshold  functions  for  certain  subgraphs  of  bigraphs 

In  a  series  of  papers,  [22],  [23 3 ^  and  [25 D ^  Erdos  and  Renyi 
have  treated  several  questions  dealing  with  the  evolution  of  random 
graphs.  With  respect  to  some  property  a  graph  may  possess,  a  typical 
problem  is  to  determine  how  many  edges  a  g^>h  should  have  in  order 
for  it  to  be  reasonably  certain,  in  some  sense,  that  it  should  possess 
the  given  property.  Presumably,  for  each  such  theorem  there  should 
exist  a  corresponding  result  for  bigraphs.  In  this  section  we  indicate 
how  a  typical  argument  appearing  in  [23]  can  be  extended  to  treat 
bigraphs.  The  result  to  be  derived  here  is  the  bipartite  analogue 
of  Theorem  1  in  the  last  cited  paper  and  the  terminology  used  there 
will  be  retained  insofar  as  is  possible. 

An  m  by  n  bigraph  with  t  edges,  0  <  t  <  mn  ,  will  be 
denoted  by  T(m,n*  t)  and  it  is  understood  that  each  of  the  (™^) 
such  configurations  is  equally  as  likely  to  be  the  graph  under 
consideration. 

We  shall  denote  by  P^  fc(A)  the  probability  that  the  random 
graph  r(m,n*t)  possesses  a  given  property  A.  If,  for  a  given  property 
A  ,  there  exists  a  function,  A(m,n)  ,  such  that 
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0  ,  if  lim 


m,  n— >  oo 


1  ,  if  lim 

m,  n— > 00 


where  t  =  t(m,n)  ,  then  A(ro,n)  will  be  called  a  threshold  function 
of  the  property  A. 


First  it  may  be  observed  that  if  t  =  o(n/ mn/(m+n) )  it  is  very 


likely  that  the  graph  consists  of  isolated  points  and  isolated  edges 
having  no  points  in  common.  For,  the  probability  that  any  two  distinct 


edges  of  r(ro,n;  t)  have  an  endpoint  in  common  is  easily  seen  to  be 


O  O  t: 

i  _  i — - — 


~  1  -  exp  -t  ( 


b  m  n 


using  the  fact,  a  consequence  of  Stirling's  formula,  that 


The  P  and  Q  degrees  of  T(m,n*  t)  are  defined  to  be  t/ra 
and  t/n  ,  respectively.  These  are  simply  the  average  degrees  of  the 
points  in  P  and  Q  ,  respectively.  r(m,n;  t)  will  be  said  to  be 
balanced  if,  and  only  if,  the  P  and  Q  degrees  of  every  proper 
subgraph  of  F(m,n;  t)  are  less  than  or  equal  to  the  P  and  Q 
degrees,  respectively,  of  r(m,nj 


t)  itself. 
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If  a+b-l<i<  ab,  a,b  >  I  ,  let  £3  =  |3(a,bj  i)  denote 
an  arbitrary  non-empty  set  of  connected  and  balanced  a  by  b 
bigraphs  with  8  edges.  Let  there  be  B  =  B(a,b$  8)  graphs  in  £3. 
We  will  prove  the  following  resultl 


Theorem  J.k.l.  The  threshold  function  for  the  property  that  the 
random  graph  T(m,nj  t)  should  contain  at  least  one  subgraph 
isomorphic  with  some  graph  in  £3(a,b;  i)  is  ro^  n"*"  . 


Let  P(£3)  denote  the  probability  described  in  the  theorem. 


Then 

B 

(5.4.1)  P(fJ)  <  (!)  (?)  — --t—  =  0(t*  ma'e  nb_i)  , 


the  number  of  ways  of  choosing  the  a  P  points  and  b  Q  points  upon 
which  may  be  formed  one  of  B  graphs  isomorphic  with  some  graph  in  £3 
times  the  number  of  ways  of  distributing  the  remaining  edges.  The 
inequality  arises  from  the  fact  that  possibly  some  graphs  are  counted 
more  than  once  in  this  process.  Since  the  last  expression  is  o(l) 
if  t  =  o  (m^  D/^)  this  suffices  to  establish  part  of  the 

theorem  by  the  definition  of  a  threshold  function. 

Let  G  =  Gm,n,  #  B  denote  the  set  of  all  subgraphs  of  the  complete 

3.  y  D  y  XJ 

m  by  n  bigraph  which  are  isomorphic  with  some  element  of  £3(a,b;  8), 
With  any  graph  S  of  this  set  associate  a  random  variable  p(s)  which 
assumes  the  value  1  or  0  according  as  to  whether  S  is  or  is  not  a 
subgraph  of  r(ro,n;  t).  Then 
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^  E  [p(s)] 
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3D  ^  mn  ^ 


B 


a!  b' 


£-a  £-b 

m  n 


for  essentially  the  same  reasons  as  before. 


If  and  are  both  in  G  and  have  no  edges  in  common 

then  clearly 

E[|i(S1)  |i(S2)]  = 

and  the  number  of  pairs  of  such  graphs  is  not  more  than 

u”)  O  v2  • 

a  b 

On  the  other  hand  if  and  S2  contain  exactly  r  ,  1  <  r  <  £-1  , 

edges  in  common  and  u  and  v  common  P  and  Q  points,  respectively, 
then  it  follows  that 


,xm-2£\ 
{  t-2£ 


D 


/rnn-2f+r  x 

iWSj)  u(Sg)]  =  -  0[(t/mn)2i"r]  . 


Applying  to  the  intersection  of  and  the  hypothesis  that  the 
graphs  in  £  were  balanced  implies  that  u  >  a  r/i  and  v  >  b  r/i  . 
Hence,  the  total  number  of  such  pairs  of  subgraphs,  and  Sg  ,  is 
not  more  than 


00  b2 


I 


u  >  a  r /  £ 


b 


/  3.  \  /ID™3 

'u' *a-u 


>  •  i 

V  >  b  r/i 


n-b 

b-v 
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where  the  first  two  factors  represent  the  number  of  ways  of  choosing 

which  points  are  to  form  part  of  a  subgraph  isomorphic  with  some  element 

of  P  and  the  sums  represent  the  number  of  ways  of  choosing  the  points 

which  are  to  form  part  of  a  second  subgraph,  having  some  points  in 

2 

common  with  the  first,  isomorphic  with  another  element  of  3.  B  is 
an  upper  bound  to  the  number  of  ways  of  choosing  which  two  elements  of 
(3  are  to  be  involved. 


Therefore, 


(3.4.3)  e  p(s))2  <  ^  e[p(s ) ] 


SeG 


SeG 


/mn-2i\ 

+  tC>(>2  ^7  +  0 

^  t' 


£ 


£-1 


£-a  £-b 

w  ro  n 


■  .  1-a/i  1-b/ £  r 

f  /  <?■■■  tn--  ) 

r=l 


multiplying  bounds  for  the  number  of  terms  of  various  types  in  the 
expansion  by  the  expected  value  of  such  terms,  as  already  determined. 


But 


/mn-2^s 

^  t-2i' 


< 


(™) 

and  supposing  that 


/inn- i/2 
1  t-£} 
/mn\2 
'  t' 


1-a/  £  1-b/ £ 


=  CO  — ►  +  oo 


m  '  n 


gives  from  (3.4.2)  and  (3.4.3)  the  result  that 


(3.4.4)  o-' 


^  M-(S)  =  0  |  ^  E(|i(S)) 


l/oo 


SeG 


SeG 


Chebyshev’s  inequality  may  be  used  to  imply  that 


r  '■  >■;  . 

-  .  £«•:  ■:  '• 
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Ptn,n;t  {  I  E^(S))I  >  \  ^  IMS))}  =  0(l/u>)  , 

SeG  SgG  SgG 

or  that 

(3.^.5)  Pljn.t  (X  <f  X  Efc(S)))  =  °(l/u)  ' 

S€G  S€G 

But  from  (3. 4.2)  we  see  that  as  to  — >  co  so  also  does 

^  E(|i(S))  which  certainly  implies  from  (3.4.5)  that  the  probability 
SeG 

that  r(ro,nj  t)  contains  at  least  one  subgraph  isomorphic  with  some 
element  of  |3(a,b;  f,)  tends  to  one  under  the  given  assumptions.  This 
completes  the  proof  of  Theorem  3.4.1  by  the  definition  of  a  threshold 
function. 


The  following  results  are  immediate  consequences  upon  observing 
that  the  subgraphs  involved  satisfy  the  hypothesis  of  the  theorem. 


Corollary  3.4.1. 
random  graph,  r(m,n$ 
a  by  b  bigraph  is  m 


The  threshold  function  for  the  property  that  the 

t)  ,  contains  a  subgraph  which  is  a  complete 
l-l/b  nl-l/a 


Corollary  3.4.2.  The  threshold  function  for  the  property  that  the 

random  graph,  T(ro,nj  t)  ,  contains  a  subgraph  which  is  a  cycle  of 
length  2k  ,  k  >  2  ,  is  (inn)^  . 

Corollary  3.4.3.  The  threshold  function  for  the  property  that  the 

random  graph >  r(m,n;  t)  ,  contains  a  subgraph  which  is  a  path  of  length 

k-1 
2k- 1 

2k-l  is  (mn)  *  the  corresponding  function  if  the  path  is  of 

length  2k  with  k+1  P  points  is  (mn)^  m 


•;  - 
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Corollary  The  threshold  function  for  the  property  that  the 

random  graph,  T(m,nJ  t)  ,  contains  a  subgraph  which  is  a  tree  with  a 

b-1  a-1 

P  points  and  b  Q  points  is  ^a+b-1  ^a+b-1 

For  an  account  of  other  problems  dealing  with  random  graphs 
which  could  be  considered  for  bipartite  graphs  also  but  which  will  not 
be  included  here  see  Katz  [30],  Austin,  Fagen,  Penney,  and  Riordan  [ 1 ] 
and  Harris  [47]. 


3.5  On  the  number  of  dyads  in  directed  bigraphs 

In  any  directed  graph  two  distinct  points  are  said  to  form  a 
dyad  if  they  are  joined  by  two  edges  which  are  directed  in  opposite 
senses.  Interpreting  the  points  as  people  and  the  directed  edges  as 
friendship  choices,  say,  then  a  dyad  represents  a  mutual  choice  between 
two  people.  Katz  and  Wilson  [51]  have  investigated  the  number  of  dyads 
in  such  a  setting  under  a  suitable  hypothesis  of  randomness.  One 
obvious  extension  of  this  problem  is  to  assume  that  each  person  belongs 
to  one  of -two  classes,  but  not  both,  and  restricts  his  choices  to 
members  of  the  other  class.  The  tacit  assumption  is  that  no  person 
is  permitted  to  choose  another  person  more  than  once. 


In  the  terminology  of  graphs  the  problem  becomes  the  following! 

Let  there  be  given  a  directed  m  by  n  bigraph  in  which  the  points  P_^ 

and  have  outdegrees  g_^  and  b^  ,  respectively,  where 

0  <  g^^  <  n  and  0  <  <  ro  ,  for  i  =  1,  ...,  m  and  j  =  1,  . . . ,  n. 

We  will  derive  expressions  for  the  expected  value  and  variance  of  D  , 


the  number  of  dyads,  i.e.  pairs  of  points,  P^  and  ,  such  that 
->Qj  anc*  Q j  -*■  ,  in  the  graph,  under  the  hypothesis  that  all 


■ 
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such  configurations  possible  are  equally  likely  and  that  the  arrangement 
of  the  edges  issuing  from  any  one  point  is  independent  of  the  arrangement 
of  the  edges  issuing  from  any  other  point. 


Let  X  ,  i  =  1,  ...,  id  and  j  =  1,  . ..,  n  ,  denote  a  random 

variable  which  assumes  the  value  1  or  0  according  as  to  whether  the 

points  P_^  and  do,  or  do  not,  form  a  dyad.  Also,  let 

m 

^  g^  for  h  =  1,  2,  the  power  sum  symmetric  function,  and 
i=l 

let  B,  be  similarly  defined.  In  what  follows  the  range  of  summation 
h 

for  various  indices  will  be  omitted  when  it  is  clear  from  the  context 


what  it  should  be. 


(3.5.1)  E(D)  =  ^  E(X..)  =  ^ 


Si  "i 

n  m 


Gi  Bi 

n  ro 


from  the  hypothesis  and  definitions  of  the  symbols  involved. 


The  hypothesis  of  independence  may  be  used  to  imply  that 


E(X.  .  X,  )  = 

ij  ki'  n  m 


Si  °j  Sk  °Z  .  ,  ,  ,  .  /  „ 

-  •  ,  if  i  /  k  and  j  /  i 


Furthermore,  it  is  easily  seen  that 


e(x. .  X. .)  = 

ij  ii' 


b.  b, 

j£ 

m2 


i  if  i  /  ^  > 


and,  by  symmetry, 


. 
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;<xijV  = 


8i  Sk  ^2^ 


Q 


,  if  i  /  k 


n 


Finally, 


E(X. .  X. .) 
ij  ij 


g.  b  . 

1 

n  in 


Therefore, 


(3.5.2)  e(i)2)  =  E  ^ 


!l2i 

n  m 


+  2 


¥} 


gi  Sk  bj  bi 

(ran)2 


+  2 


g,  (g,.-l)  ^  b£ 


i~in  n(n-l) 
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+  2 


bi(br1)  gi  gk 


ijflc 


in 


(m-l)  nc 


Gi  Ei 

n  m 


+  2 


!  (G®-G2)(B^-B£)  (Gg-GjHB^-Bg)  (Bg-B^G^-Gg) 


(nm)‘ 


n(n-l) 


in 


m(m-l)  n' 


2 


upon  expanding  and  collecting  terms  in  the  original  expression  and  taking 
the  expected  value  of  each  term. 

For  given  values  of  the  g^'s  an<^  b/s  the  various  expressions 
appearing  may  be  computed  with  no  difficulty  which  enables  the  variance 
of  D  to  be  determined  by  using 


(3.5.3)  a2  (D)  =  E(D2)  -  [E(D)]2  . 


When  all  g.  =  g  and  all  b.  =  b  some  simplification  occurs* 
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in  particular  when  g  «  b  =  1  (3.5.1)  and  (3.P.3)  become 


(3.5.  *0  E(D)  =  1  , 


and 


(3.5.5)  -2(d)  = 


In  this  last  case;  where  the  outdegree  of  every  point  is  one, 
let  D(ro,n)  be  the  number  of  dyads  in  a  directed  m  by  n  graph  under 
the  same  assumptions  about  randomness  as  given  earlier.  Let  m  >  n. 

It  is  an  immediate  consequence  of  the  principle  of  cross-classification 
(cf.  Feller  [26],  p.  9 6)  that 


n-k 


ro-k-i  n-k-i 
n  m 


ro  n 
n  ro 


for  k  =  0,  1, 


In  particular 


n 


J=U 


which  provides  another  generalization  of  the  ’’problSme  des  recontres." 
(See  e.g.  Riordan  [84],  p.  37.) 
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